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Description 

Background of the invention 

5 [0001] The pharmaceutical, the fine chemicals and the food industry need a number of compounds that have to be 
isolated from complex mixtures such as extracts of animal or plant tissue, or fermentation broth. Often these isolation 
processes determine the price of the product. 

[0002] Conventional isolation processes are not very specific and during the isolation processes the compound to 
' be isolated is diluted considerably with the consequence that expensive steps for removing water or other solvents 
10 have to be applied. 

[0003] For the isolation of some specific compounds affinity techniques are used. The advantage of these techniques 
is that the compounds bind very specifically to a certain ligand. However these ligands are quite often very expensive. 
[0004] To avoid spillage of these expensive ligands they can be linked to an insoluble support. However, often linking 
the ligand is also expensive and, moreover, the functionality of the ligand is often affected negatively by such procedure. 
15 [0005] So a need exists for developing cheap processes for preparing highly effective immobilized ligands. 

[0006] The non-prepublished co-pending international patent application WO 94/01567 (UNILEVER) discloses a 
method for immobilizing an enzyme or functional part thereof on the cell wall of a lower eukaryote, whereby the enzyme 
or functional part thereof is localized at the exterior of the cell wall. 

[0007] WO 92/20805 discloses the production of heterologous proteins in Gram-positive bacteria wherein the het- 
20 erologous proteins are bound to the surface of the bacteria by a cell wall spanning and membrane anchoring amino 
acid sequence. 

[0008] J.E. Francisco et al, Proc. Natl. Acad. Sci. USA 89 (1992) 2713-2717, disclose a fusion protein obtained by 
the expression of a chimeric gene consisting of (i) the signal sequence and first nine N-terminal amino acids of the 
mature major E. coli lipoprotein, (ii) amino acids 46-159 of the outer membrane OmpA, and (iii) the complete mature 
25 p-lactamase sequence. 

[0009] The fusion protein had an enzymatically active p-lactamase and was found predominantly in the outer mem- 
brane. A substantial fraction (20-30%) of the p-lactamase domain of the protein was exposed on the external surface 
ofE. coli. 

[0010] In J. Bacterid. 171 (1989) 4569-4576, the anchoring of p-galactosidase to the E. coli inner plasma membrane 
30 is disclosed. It is suggested that by combining a signal sequence at the N-terminus with the hydrophobic domain at 
the N-terminus, many proteins, although not p-galactosidase, could become associated with the outer aspect of the 
inner membrane or with the outer membrane. Such modified bacteria might have potential use as immobilized enzyme 
systems. 

[0011] S.W. Hiebert et al, J. Cell Biology 107 (1988) 865-876, describe that the soluble cytoplasmic protein pyruvate 
35 kinase (PK) has been expressed at the cell surface of CV1 cells in a membrane-anchored form (APK). Truncated forms 

of the APK molecule, with up to 80% of the PK region of APK removed, can also be expressed at the cell surface. 

[0012] In WO 89/07140 a tripartite DNA sequence is described that is so fused that the gene will upon expression 

in a eukaryotic cell (exemplified are COS cells) give rise to a correctly processed, appropriately folded, membrane 

bound version of the protein. Various anchor domains are mentioned. 
40 [0013] C.C. Chen et al, J. Biol. Chem. 265 (1990) 3161-3167, describe that the predicted amino acid sequence 

reveals the presence of a protine-rich cell wall anchor region similar to that of other Gram-positive surface proteins. 

[0014] Vijaya et al, Mol. Cell. Biol. 8 (1988) 1709-1714, describe the cell surface localization of hybrid proteins on 

the outer surface of CV-1 cells infected by recombinant viruses. 

45 Summary of the invention 

[0015] The invention is a process for carrying out an isolation process by using an immobilized binding protein or 
functional part thereof still capable of binding to a specific compound, wherein a medium containing said specific com- 
pound is contacted with a host cell according to the invention under conditions whereby a complex between said specific 
50 compound and said immobilized binding protein is formed, separating said complex from the medium originally con- 
taining said specific compound and, optionally, releasing said specific compound from said binding protein or functional 
part thereof. 

[0016] Preferably, the host is selected from Gram-positive bacteria and fungi, which have a cell wall at the outside 
of the host cell, in contrast to Gram-negative bacteria and cells of higher eukaryotes such as animal cells and plant 
55 cells, which have a membrane at the outside of their cells. Suitable Gram-positive bacteria comprise lactic acid bacteria 
and bacteria belonging to the genera Bacillus and Streptomyces. Suitable fungi comprise yeasts belonging to the 
genera Candida, Debaryomyces, Hansenula, Kluyveromyces, Pichia and Saccharomyces, and moulds belonging to 
the genera Aspergillus, Penicillium and Rhizopus. In this specification the group of fungi comprises the group of yeasts 
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and the group of moulds, which are also known as lower eukaryotes. In contrast to the cells in plants and animals, the 
group of bacteria and lower eukaryotes are also indicated in this specification as microorganisms. 
[0017] Recombinant polynucleotides capable of being used in a method as described above, are polynucleotides 
comprising (i) a structural gene encoding a binding protein or a functional part thereof still having the specific binding 
5 capability, and (ii) at least part of a gene encoding an anchoring protein capable of anchoring in the cell wall of a Gram- 
positive bacterium or a fungus, said part of a gene encoding at least the anchoring part of said anchoring protein, which 
anchoring part is derivable from the C-terminal part of said anchoring protein. 

[0018] The anchoring protein can be selected from a-agglutinin, a-aggiutinin, FL01 , the Major Cell Wall Protein of 
a lower eukaryote, and proteinase of lactic acid bacteria. Preferably, such polynucleotide further comprises a nucleotide 
10 sequence encoding a signal peptide ensuring secretion of the expression product of the polynucleotide, which signal 
peptide can be derived from a protein selected from the ct-mating factor of yeast, a-agglutinin of yeast, invertase of 
Saccharomyces, inulinase of Kluyveromyces, a-amylase of Bacillus, and proteinase of lactic acid bacteria. The poly- 
nucleotide can be operably linked to a promoter, which is preferably an inducible promoter. 

15 Brief description of the figures 

[0019] 

In Figure 1 the composition of pEMBL9-derived piasmid pUR4122 is indicated, the preparation of which is de- 
20 scribed in Example 1 . 

In Figure 2 the composition of piasmid pUR2741 is indicated, which is a derivative of published piasmid pUR2740, 
see Example 1 . 

In Figure 3 the composition of pEMBL9-derived piasmid pUR2968 is indicated. Its preparation is described in 
Example 1 . 

25 in Figure 4 the preparation of piasmid pUR4174 starting from plasmids pUR2741, pUR2968 and pUR4122 is 

indicated, as well as the preparation of piasmid pUR4175 starting from plasmids pSY16, pUR2968 and pUR4122. 
These preparations are described in Example 1 . 

In Figure 5 the composition of piasmid pUR2743.4 is indicated. Its preparation is described in Example 2. It contains 
the 714 bp Pst\-Xho\ fragment given in SEQ ID NO: 12, which fragment encodes an scFv-TRAS fragment of anti- 

30 traseolide® antibody 02/01/01 . 

In Figure 6 the composition of piasmid pUR4178 is indicated. Its preparation is indicated in Example 2. It contains 
the above mentioned 714 bp Pst\-Xho\ fragment given in SEQ ID NO: 1 2. This piasmid is suitable for the expression 
of a fusion protein between scFv-TRAS and otAGG preceded by the invertase signal sequence (SUC2). 
In Figure 7 the composition of piasmid pUR4179 is indicated. Its preparation is indicated in Example 2. It contains 

35 the above mentioned 71 4 bp Pst\-Xho\ fragment given in SEQ ID NO: 1 2. This piasmid is suitable for the expression 

of a fusion protein between scFv-TRAS and aAGG preceded by the prepro-a-mating factor signal signal sequence. 
In Figure 8 a molecular design picture is given, showing the musk odour molecule traseolide® and a modified 
musk antigen, described in Example 3. 

In Figure 9 the composition of piasmid pUR4177 is indicated. Its construction is described in Example 4. Piasmid 
40 pUR4177 contains the 734 bp Eag\-Xho\ DNA fragment given in SEQ ID NO: 13 encoding the variable regions of 

the heavy and light chain fragments from the monoclonal antibody directed against the human chorionic gonado- 
tropin (an scFv-HCG fragment) and is a 2 um-based vector suitable for production of the chimeric scFv HCG- 
ocAGG fusion protein preceded by the invertase signal sequence and under the control of the GAL7 promoter. 
In Figure 10 the composition of piasmid pUR4180 is indicated. Its preparation is indicated in Example 4. It contains 
45 the above mentioned 734 bp Eag\-Xho\ DNA fragment given in SEQ ID NO: 1 3 and is a 2 um-based vector suitable 

for production of the chimeric scFv-HCG-aAGG fusion protein preceded by the prepro-a-mating factor signal se- 
quence and under the control of the GAL7 promoter. 

In 'Figure 11 the composition of piasmid pUR2990, a 2 um-based vector, is indicated, which is suggested in Ex- 
ample 5 as a starting vector for the preparation of piasmid pUR4196 (see Figure 12). Piasmid pUR2990 contains 
50 a DNA fragment encoding a chimeric lipase-FL01 protein that will be anchored in the cell wall of a lower eukaryote 

and can catalyze lipid hydrolysis. 

In Figure 12 the composition of piasmid pUR41 96 is indicated. Its preparation is explained in Example 5. It contains 
a DNA fragment encoding a chimeric protein comprising the scFv-HCG followed by the C-terminal part of the 
FLOl -protein, and is a vector suitable for the production of a chimeric protein anchored in the cell wall of the host 
55 organism and can bind HCG. 

In Figure 1 3 the composition of piasmid pUR2985 is indicated. Its preparation is described in Example 6. It contains 
a choS gene coding for the mature part of the cholesterol oxidase (EC 1 .1 .3.6) obtained via PCR techniques from 
the chromosome of Brevibacterium sterolicum. 
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In Figure 14 the composition of plasmid pUR2987 is indicated. Its preparation from plasmid pUR2985is described 
in Example 6. It contains a DNA sequence comprising the choB gene coding for the mature part of the cholesterol 
oxidase preceded by DNA encoding the prepro-a-mating factor signal sequence and followed by DNA encoding 
the C-terminal part of ct-agglutinin. 
5 In Figure 15 the composition of the published plasmid pGKV550 is indicated. It is described in Example 7 and 

contains the complete cell wall proteinase operon of Lactococcus lactis subsp. cremoris Wg2, including the pro- 
moter, the ribosome binding site and the prtP gene. 

In Figure 16 the composition of plasmid pUR2988 is indicated. Its preparation is described in Example 7. It is 
anticipated that this plasmid can be used for preparing a further plasmid pUR2989, which after introduction in a 
10 lactic acid bacterium will be responsible for producing a chimeric protein that will be anchored at the outer surface 

of the lactic acid bacterium and is capable of binding cholesterol. 

In Figure 17 the composition of plasmid pUR2993 is indicated. Its preparation is described in Example 8. It is 
anticipated that this plasmid can be used for transforming yeast cells that can bind a human epidermal growth 
factor (EGF) through an anchored chimeric protein containing an EGF receptor. 
15 in Figure 18 the composition of plasmids pUR4482 and 4483 is indicated. Their preparation is described in Example 

9. Plasmid pUR4482 is a yeast episomal expression plasmid for expression of a fusion protein with the invertase 
signal sequence, the CH V 09 variable region, the Myc-tail, and the "X-P-X-P" Hinge region of a camel antibody, 
and the cc-agglutinin cell wall anchor region. Plasmid pUR4483 differs from pUR4482 in that it does not contain 
the "X-P-X-P" Hinge region. 

20 in Figure 19 immunofluorescent labelling (anti-Myc antibody) of SU10 cells in the exponential phase (OD 530 = 

0.5) expressing the genes of camel antibodies present on plasmids pUR4424, pUR4482 and pUR4483 is shown. 
Ph = phase contrast, Fl = fluorescence. 

In Figure 20 immunofluorescent labelling (anti-human IgG antibody) of SU10 cells in the exponential phase 
(OD 530 =0.5) expressing the genes of camel antibodies present on plasmids pUR4424, pUR4482 and pUR4483 is 
25 shown. 

Ph = phase contrast, Fl = fluorescence. 



Abbreviations used in the Figures: 



30 [0020] 





a-gal: 


gene encoding guar a-galactosidase 




AG-alpha-1/AGa1: 


gene expressing ot-agglutinin from S. cerevisiae 




AGa1 cds/a-AGG: 


coding sequence of a-agglutinin 


35 


Amp/amp r: 


p-lactamase resistance gene 




CHv09: 


camel heavy chain variable 09 fragment 




EmR: 


erythromycin resistance gene 




f1: 


phage f1 replication sequence 




FL01/FLO (C-part): 


C-terminal part of FL01 coding sequence of flocculation protein 


40 


Hinge: 


Camel "X-P-X-P" Hinge region, see Example 9 




LEU2: 


LEU2 gene 




LEU2d/Leu2d: 


truncated LEU2 gene 




Leu 2d cs: 


coding sequence LEU2dgene 




MycT: 


camel Myc-tail 


45 


OriMBI: 


origin of replication MB1 derived from E. coli plasmid 




Pga17/pGAL7: 


GAL7 promoter 




Tpgk: 


terminator of the phosphoglyceratekinase gene 




ppa-MF/MFa1ss: 


prepro-part of a-mating factor (= signal sequence) 




repA: 


gene encoding the repA protein required for replication (Fig. 15/16). 


50 


ScFv (Vh-VI): 


single chain antibody fragment containing V H and V L chains 




ss: 


signal sequence 




SUC2: 


invertase signal sequence 




2u/2 micron: 


2um sequence 



55 Detailed description of the invention 



[0021] The present invention relates to the isolation of valuable compounds from complex mixtures by making use 
of immobilized ligands. The immobilized ligands can be proteins obtainable via genetic engineering and can consist 
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of two parts, namely both an anchoring protein or functional part thereof and a binding protein or functional part thereof. 
[0022] The anchoring protein sticks into cell walls of microorganisms, preferably lower eukaryotes, e.g. yeasts and 
moulds. Often this type of proteins has a long C-terminal part that anchors it in the cell wall. These C-terminal parts 
have very special amino acid sequences. A typical example is anchoring via C-terminal sequences of proteins enriched 
5 in proline, see Kok (1990). 

[0023] The C-terminal part of these anchoring proteins can contain a substantial number of potential serine and 
threonine glycosylation sites. O-glycosylation of these sites gives a rod-like conformation to the C-terminal part of these 
proteins. 

In the case of anchored manno-proteins they seem to be linked to the glucan in the cell wall of lower eukaryotes, as 

10 they cannot be extracted from the ceil wall with sodium dodecyl sulphate (SDS), but can be liberated by glucanase 
treatment, see our co-pending patent application WO-94/01 567 (UNILEVER) published 20 January 1 994 and Schreud- 
er c.s. (1993), both being published after the claimed priority date. WO-A-94/01567 discloses the use of anchored 
enzymes in an enzymatic process. Another mechanism to anchor proteins at the outer side of a cell is to make use of 
the property that a protein containing a glycosyl-phosphatidyl-inositol (GPI) group anchors via this GPI group to the 

15 cell surface, see Conzelmann c.s. (1990). 

[0024] The binding protein is so called, because it ligates or binds to the specific compound to be isolated. If the N- 
terminal part of the anchoring protein is sufficiently capable of binding to a specific compound, the anchoring protein 
itself can be used in a process for isolating that specific compound. Suitable examples of a binding protein comprise 
an antibody, an antibody fragment, a combination of antibody fragments, a receptor protein, an inactivated enzyme 

20 still capable of binding the corresponding substrate, and a peptide obtained via Applied Molecular Evolution, see Lewin 
(1990), as well as a part of any of these proteinaceous substances still capable of binding to the specific compound 
to be isolated. All these binding proteins are characterized by specific recognition of the compounds or group of related 
compounds to be isolated. The binding rate and release rate, and therefore the binding constant between the specific 
compound to be isolated and the binding protein, can be regulated either by changing the composition of the liquid 

25 extract in which the compound is present or, preferably, by changing the binding protein by protein engineering. 

[0025] The gene coding for the chimeric protein comprising both the binding protein and the anchoring protein (or 
functional parts thereof) can be placed under control of a constitutive, inducible or derepressible promoter and will 
generally be preceded by a DNA fragment encoding a signal sequence ensuring efficient secretion of the chimeric 
protein. Upon secretion the chimeric protein will be anchored in the cell wall of the microorganisms, thereby covering 

30 the surface of the microorganisms with the chimeric protein. These microorganisms can be obtained in normal fermen- 
tation processes and their isolation is a cheap process, when physical separation processes are used, e.g. centrifu- 
gation or membrane filtration. 

[0026] After washing, the isolated microorganisms can be added to liquid extracts containing the valuable specific 
compound or compounds. After some time the equilibrium between the bound and free specific compound(s) will be 

35 reached and the microorganisms to which the specific compound or group of related compounds is bound can be 
separated from the extract by simple physical techniques. Alternatively, the microorganisms covered with ligands can 
be brought on a support material and subsequently this coated support material can be used in a column. 
[0027] The liquid extract containing the specific compound or compounds of interest can be added to the column 
and afterwards the compound(s) can be released from the ligand by changing the composition of the eluting liquid or 

40 the temperature or both. A skilled person will recognize that in addition to these two possibilities other modifications 
can be used for effecting the binding of the specific compound and the ligand, their subsequent isolation and/or the 
release of the specific compound(s). 

[0028] In particular the invention relates to chimeric proteins that are bound to the cell wall of lower eukaryotes. 
Suitable lower eukaryotes comprise yeasts, e.g. Candida, Debaryomyces, Hansenula, Kluyveromyces, Pichia and 
45 Saccharomyces, and moulds e.g. Aspergillus, Penicillium and Rhizopus. For some applications prokaryotes are also 
applicable, especially Gram-positive bacteria, examples of which include lactic acid bacteria, and bacteria belonging 
to the genera Bacillus and Streptomyces. 

[0029] For lower eukaryotes the present invention provides genes encoding chimeric proteins consisting of: 

50 a. a DNA sequence encoding a signal sequence functional in a lower eukaryotic host, e.g. derived from a yeast 

protein including the cc-mating factor, invertase, ot-agglutinin, inulinase or derived from a mould protein e.g. xyla- 
nase; 

b. a structural gene encoding a C-terminal part of a cell wall protein preceded by a structural gene encoding a 
protein, that is capable of binding to the specific compound or group of compounds of interest, examples of which 
55 include 

an antibody, 

a single chain antibody fragment (scFv; see Bird and Webb Walker (1991), 
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a variable region of the heavy chain (V H ) or a variable region of the light chain (V L ) of an antibody or that part 
of such variable region still containing one to three of the complementarity determining regions (CDRs), 
an agonist-recognizing part of a receptor protein or a part thereof still capable of binding the agonist, 
a catalytically inactivated enzyme, or a fragment of such enzyme still containing a substrate binding site of the 
5 enzyme, 

specific lipid binding proteins or parts of these proteins still containing the lipid binding site(s), see Ossendorp 
(1992), and 

a peptide that has been obtained via Applied Molecular Evolution, see Lewin (1990). 

10 [0030] All expression products of these genes are characterized in that they consists of a signal sequence and both 
a protein part, that is capable of binding to the compound(s) to be isolated, and a C-terminus of a typically eel! wall 
bound protein, examples of the latter including a-agglutinin, see Lipke c.s. (1989), a-agglutinin, see Roy c.s. (1991), 
FL01 (see Example 5 and SEQ ID NO: 14) and the Major Cell Wall Protein of lower eukaryotes, which C-terminus is 
capable of anchoring the expression product in the cell wall of the lower eukaryote host organism. 

15 [0031] The expression of these genes encoding chimeric proteins can be under control of a constitutive promoter, 
but an inducible promoter is preferred, suitable examples of which include the GAL7 promoter from Saccharomyces, 
the inulinase promoter from Kluyveromyces, the methanol-oxidase promoter from Hansenula, and the xylanase pro- 
moter of Aspergillus. Preferably the constructs are made in such a way that the new genetic information is integrated 
in a stable way in the chromosome of the host cell, see e.g. WO-91/00920 (UNILEVER). 

20 [0032] The lower eukaryotes transformed with the above mentioned genes can be grown in normal fermentation, 
continous fermentation, or fed batch fermentation processes. 

[0033] The selection of a suitable process for growing the microorganism will depend on the construction of the gene 
and the promoter used, and on the desired purity of the cells after the physical separation procedure(s). 
[0034] For bacteria the present invention deals with genes encoding chimeric proteins consisting of: 

25 

a. a DNA sequence encoding a signal sequence functional in the specific bacterium, e.g. derived from a Bacillus 
a-amylase, a Bacillus subtilis subtilisin, or a Lactococcus lactis subs p. cremoris proteinase; 

b. a structural gene encoding a C-terminal part of a cell wall protein preceded by a structural gene encoding a 
protein capable of binding to the specific compound or group of compounds of interest, examples of which are 

30 given above for a lower eukaryote. 

[0035] All expression products of these genes are characterized in that they consist of a signal sequence and both 
a protein part, that is capable of binding to the specific compound or specific group of compounds to be isolated, and 
a C-terminus of a typically cell wall-bound protein such as the proteinase of Lactococcus lactis subsp. cremoris strain 
35 Wg2, see Kok c.s. (1988) and Kok (1990), the C-terminus of which is capable of anchoring the expression product in 
the cell wall of the host bacterium. 

[0036] The invention is illustrated with the following Examples without being limited thereto. First the endonuclease 
restriction sites mentioned in the Examples are given. 

40 



45 



50 



BstEII G GTNACC 
CCANTG G 

£coRI G AATTC 
CTTAA G 



Clal 



NotI 
Sad 



GAGCT C 
C TCGAG 



AT CGAT 
TAGC TA 



ifindlll A AGCTT 
TTCGA A 



GC GGCCGC wrui 

CGCCGG CG 



Sail 



TCG CGA 
AGC GCT 

G TCGAC 
CAGCT G 



EagI 
Nhel 
PStI 

Xhol 



C GGCCG 
GCCGG C 

G CTAGC 
CGATC G 

CTGCA G 
G ACGTC 

C TCGAG 
GAGCT C 
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Example 1 . Construction of a gene encoding a chimeric protein that will be anchored in the cell wall of a lower 
eukaryote and is able to bind with high specificity lysozyme from a complex mixture. 

[0037] Lysozyme is an anti-microbial enzyme with a number of applications in the pharmaceutical and food industries. 

5 Several sources of lysozyme are known, e.g. egg yolk or a fermentation broth containing a microorganism producing 
lysozyme. Monoclonal antibodies have been raised against lysozyme, see Ward c.s. (1 989), and the mRNA's encoding 
the light and heavy chains of such antibodies have been isolated from the hybridoma cells and used as template for 
the synthesis of cDNA using reverse transcriptase. Starting from the plasmids as described by Ward c.s. (1989), we 
constructed a pEMBL-derived plasmid, designated pUR4122, in which the multiple cloning site of the pEMBL-vector, 

10 ranging from the EcoRI to the H/ndlll site, was replaced by a 231 bp DNA fragment, whose nucleotide sequence is 
given in SEQ ID NO: 1 and has an EcoRI site (GAATTC) at nucleotides 1-6, a Pstt site (CTGCAG) at nucleotides 
105-110, a BsfEII site (GGTCACC) at nucleotides 122-128, a Xnol site (CTCGAG) at nucleotides 207-212, and a H/ndlll 
site (AAGCTT) at nucleotides 226-231. 

15 Construction of pUR4122 

[0038] Plasmid pEMBL9, see Dente c.s. (1983), was digested with EcoRI and H/ndlll and the resulting large fragment 
was ligated with the double stranded synthetic DNA fragment given in SEO ID NO: 1. For the successive ligation of 
DNA fragments, which finally form the coding sequence of a single chain antibody fragment for lysozyme, the following 

20 elements were combined in the 231 bp DNA fragment (SEQ ID NO: 1 ) inserted into the pEMBL-9 vector: the 3' part of 
the GAL7 promoter, the invertase signal sequence (SUC2), a Pst\ restriction site, a BsfEII restriction site, a sequence 
encoding the (GGGGS)x3 peptide linker connecting the V H and V L fragments, a Sacl restriction site, a Xnol restriction 
site and a H/ndlll restriction site, resulting in plasmid pUR4119. To obtain the in frame fusion between V H and the 
GGGGS-linker plasmid pSW1-VHD1.3-VKD1.3-TAG1, see Ward c.s. (1989), was digested with Psfl and BsfEII and 

25 a DNA fragment of 0.35 kbp was ligated in the correspondingly digested pUR4119 resulting in plasmid pUR4119A. 
Subsequently the plasmid pSW1-VHD1 .3-VKD1 .3-TAG1 was digested with Sacl and Xnol and this fragment containing 
the coding part of V L was finally ligated into the Sac\/Xho\ sites of pUR4119A, resulting in plasmid pUR4122 (see Figure 
1). 

30 Construction of pUR4174, see Figure 4 

[0039] To obtain S. cerevisiae episomal expression plasmids containing DNA encoding a cell wall anchor derived 
from the C-termina! part of a-agglutinin, plasmid pUR2741 (see Figure 2) was selected as starting vector. Basically, 
this plasmid is a derivative of pUR2740, which is a derivative of plasmid pUR2730 as described in WO-91/19782 
35 (UNILEVER) and by Verbakel (1 991 ). The preparation of pUR2730 is clearly described in Example 9 of EP-A1 -02551 53 
(UNILEVER). Plasmid pUR2741 differs from plasmid pUR2740 in that the Eag\ restriction site within the remaining 
part of the already inactive tet resistance gene was deleted through NruVSall digestion. The Sail site was filled in prior 
to religation. 

[0040] After digesting pUR4122 with Sacl (partially) and H/ndlll, the approximately 800 bp fragment was isolated 
40 and cloned into the pUR2741 vector fragment, which was obtained after digestion of pUR2741 with the same enzymes. 
The resulting plasmid was named pUR4125. 

[0041] A plasmid named pUR2968 (see Figure 3) was made by (1 ) digesting with H/ndlll the dgccf-containing plasmid 
pl_a21 published by Lipke c.s. (1 989), (2) isolating an about 6.1 kbp fragment and (3) ligating that fragment with Hindi II- 
treated pEMBL9, so that the 6.1 kbp fragment was introduced into the H/ndlll site present in the multiple cloning site 
45 of the pEMBL9 vector. 

[0042] Plasmid pUR4125 was digested with Xhol and H/ndlll and the about 8 kbp fragment was ligated with the 
approximately 1.4 kbp Nhe\-Hinti\\\ fragment of pUR2968, using Xho\/Nhe\ adapters having the following sequence: 

so XhoZ Nhel 

5 » -TC GAG ATC AAA GGC GGA TCT Q -3 1 SEQ ID NO: 2 

3'- C TAG TTT CCG CCT AGA CGATC -S * « SEQ ID NO: 3. 

[0043] The plasmid resulting from the ligation of the appropriate parts of plasmids pUR2968, pUR4125 and Xnol/ 
55 Nhe\ adapters, was designated pUR4174 and encodes a chimeric fusion protein at the amino terminus consisting of 
the invertase signal (pre) peptide, followed by the scFv-LYS polypeptide and, finally, the C-terminal part of a-agglutinin 
(see Figure 4). 
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Construction of pUR4175, see Figure 4 

[0044] Upon digesting pUR4122 (see above) with Pst\ and H/ndlll, the approximately 700 bp fragment was isolated 
and ligated into a vector fragment of plasmid pSY16, see Harmsen c.s. (1993), which was digested with Eagl and 
H/ndlll and using Eag\-Pst\ adapters, having the following sequence: 

Eagl PstJ 
5'- G GCC G CC CAG GTG CAG CTG CA- 3 1 = SEQ ID NO: 4 

3 CGG GTC CAC GTC £ -5 1 « SEQ ID NO: 5 

[0045] The resulting plasmid, named pUR41 32, was digested with Xnol and H/V?dlll and ligated with the approximately 
1.4 kbp /Wjel-H/ndlll fragment of pUR2968 (see above), using Xho\/Nhe\ adapters as described above, resulting in 
pUR4175 (see Figure 4). This plasmid contains a gene encoding a chimeric protein consisting of the a-mating factor 
prepro-peptide, followed by the scFv-LYS polypeptide and, finally, the C-terminal part of cc-agglutinin. 

Example 2. Construction of genes encoding a series of homologous chimeric proteins that will be anchored 
in the cell wall of a lower eukaryote and are able to bind with high specificities the musk fragrance trascolide® 
from a complex mixture. 

[0046] The isolation of RNA from the hybridoma cell lines, the preparation of cDNA and amplification of gene frag- 
ments encoding the variable regions of antibodies by PCR was performed according to standard procedures known 
from the literature, see e.g. Orlandi c.s. (1989). For the PCR amplification different oligonucleotide primers have been 
used. 

[0047] For the heavy chain fragment: 

A: AGG TSM AR C TGC AGS AGT CWG G = SEQ ID NO: 6 

Pstl 

in which S is C or G, M is A or C, R is A or G, and W is A or T, 
and 



B: TGA GGA GAC GGT G AC C GT GGT CCC TTG GCC CC 

BstEU = SEQ ID NO: 7. 



For the light chain fragment (Kappa): 

C GAC ATT GAG CTC ACC CAG TCT CCA = SEQ ID NO: 8, 

Sort 

and 



D: GTT TGA TCT CGA G CT TGG TCC C « SEQ ID NO: 9. 

XIiol 

Construction of pUR4143 

[0048] To simplify future construction work an Eagl restriction site was introduced in pUR4122 (see above), at the 
junction between the invertase signal sequence and the scFv-LYS. This was achieved by replacing the about 110 bp 
EcoRI-Psfl fragment within the synthetic fragment given in SEQ ID NO: 1 by synthetic adapters with the following 
sequence: 
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Ecom Pstl 
AATJCGGCCGTTCAGGTGCAGCTGg^ 
GCCGGCAAGTCCACGTCG 



= SEQ ID NO: 10 
« SEQ ID NO: 11. 



5 



[0049] The resulting plasmid was designated pUR41 22.1 : a construction vector for single chain Fv assembly in frame 
behind an Eag\ site for expression behind either the prepro-a-mating factor sequence or the SUC2 invertase signal 
10 sequence. 

[0050] After digesting the heavy chain PCR fragment with Psfl and BstEU, two fragments were obtained: a Psfl 
fragment of about 230 bp and a Psfl/ BstEU fragment of about 110 bp. The latter fragment was cloned into vector 
pUR4122.1, which was digested with Psfl and BstEU. The newly obtained plasmid (pUR4122.2) was digested with 
Sad and Xho\, after which the light chain PCR fragment (digested with the same restriction enzymes) was cloned into 
15 the vector, resulting in pUR41223. This plasmid was digested with Psfl, after which the above described about 230 bp 
Psfl fragment was cloned into the plasmid vector, resulting in a plasmid called pUR4143. Two orientations are possible, 
but selection can be made by restriction analysis, as usual. Instead of the scFv-LYS gene originally present in pUR41 22, 
this new plasmid pUR4143 contains a gene encoding an scFv-TRAS fragment of anti-traseolide antibody 02/01/01 (for 
the nucleotide sequence of the 714 bpPsfl-X/?ol fragment see SEQ ID NO: 12). 



Construction of pUR4178 and pUR4179. 

[0051] After digesting pUR4143 with Eagl and with H/ndlll, an about 71 5 bp fragment can be isolated. Subsequentely, 
this fragment can be cloned into the vector backbone fragments of pUR2741 and pUR4175, that were digested with 
25 the same restriction enzymes. In the case of pUR2741 , this resulted in plasmid pUR2743.4 (see Figure 5). This plasmid 
can subsequently be cleaved with Xho\ and H/ndlll and ligated with the about 8 kbp Xhol-H/ndlll fragment of pUR41 74, 
resulting in pUR4178 (see Figure 6). 

[0052] In the situation where pUR41 75 was used as a starting vector, the resulting plasmid was designated pUR41 79 
(see Figure 7). 

30 Both plasmids, pUR4178 and pUR4179 were introduced into S. cerevisiae. 

Example 3. The modification of the binding parts of the chimeric protein that can bind traseolide® in order to 
improve the binding or release of traseolide® under certain conditions. 

35 [0053] Modification of binding properties of antibodies during the immune response is a well known immunological 
phenomenon originating from the fine tuning of complementarity determining sequences in the antibody's binding region 
to the antigen's molecular properties. This phenomenon can be mimicked in vitro by adjusting the antigen binding 
regions of antibody fragments based on molecular models of these regions in contact with the antigen. 
[0054] One such example consists of protein engineering the antimusk antibody M02/01/01 to a stronger binding 

40 variant M020501L 

[0055] First, a molecular model of M02/01/01 variable fragment (Fv) was constructed by homology modelling, using 
the coordinates of the anti-lysozyme antibody HYHEL-10 as a template (Brookhaven Protein Data Bank entry: 3HFM). 
This mode! was refined using Molecular Mechanics and Molecular Dynamics methods from within the Biosym program 
DISCOVER, on a Silicon Graphics 4D240 workstation. Secondly, the binding site of the resulting Fv was mapped by 
45 visually docking the musk antigen into the CDR region, followed by a refinement using molecular dynamics again. 
Upon inspection of the resulting model for packing efficiency (van der Waals contact areas), it was concluded that 
substitution of ALA H96 by VAL would increase the (hydrophobic) contact area between the ligand and Fv, and con- 
sequently lead to a stronger interaction (see Figure 8). 

[0056] When this mutation is introduced into M02/01/01, the cDNA-derived scFv from Example 2, the result will be 
50 Fv M020501i; a variant with an increased affinity of at least a factor of 5 can be expected, and the increased affinity 
could be measured using fluorescence titration of the Fv with the musk odour molecule. 

Example 4. Construction of a gene encoding a chimeric protein that will be anchored in the cell wall of lower 
eukaryote and is able to bind hormones such as HCG. 



[0057] Gene fragments, encoding the variable regions of the heavy and light chain fragments from the monoclonal 
antibody directed against the human chorionic gonadotropin were obtained from a hybridoma cell line in a similar way 
as described in Example 2. 



20 



55 
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[0058] Subsequently, these HCG and gene fragments were cloned into plasmid pUR4143 by replacing the 
corresponding Psfl-BsfEII and Sac\-Xho\ gene fragments, resulting in plasmid pUR4146. 

[0059J Similar to the method described in Example 2, the 734 bp Eag\-Xho\ fragment (nucleotide sequence given in 
SEQ ID NO: 13) encoding the variable regions of the heavy and light chain fragments from the monoclonal antibody 
5 directed against the human chorionic gonadotropin (an scFv-HCG fragment) was isolated from pUR4146 and was 
introduced into the vector backbone fragment of pUR4178 (see Example 2) and will be introduced into the vector 
backbone fragment of pUR4175 (see Example 1), both digested with the same restriction enzymes. The resulting 
plasmids pUR4177 (see Figure 9) was, and pUR4180 (see Figure 10) will be, introduced into S. cerevisiae strain SU10. 

10 Example 5. Construction of a gene encoding a chimeric scFv-FLOl protein that will be anchored in the cell 
wall of lower eukaryote and is able to bind hormones such as HCG. 

[0060] One of the genes associated with the flocculation phenotype in S. cerevisiae is the FL01 gene. The DNA 
sequence of a clone containing major parts of the FL01 gene has been determined, see SEQ ID NO: 14 giving 2685 

15 bp of the FL01 gene. The cloned fragment appeared to be approximately 2 kb shorter than the genomic copy as judged 
from Southern and Northern hybridizations, but encloses both ends of the FL01 gene. Analysis of the DNA sequence 
data indicates that the putative protein contains at the N-terminus a hydrophobic region which confirms a signal se- 
quence for secretion, a hydrophobic C-terminus that might function as a signal for the attachment of a GPI-anchor and 
many glycosylation sites, especially in the C-terminus, with 46.6% serine and threonine in the arbitrarily defined C- 

20 terminus (aa 271-894). Hence, it is likely that the FL01 gene product is located in an orientated fashion in the yeast 
cell wall and may be directly involved in the process of interaction with neighbouring cells. 

[0061] The cloned FL01 sequence might therefore be suitable for the immobilization of proteins or peptides on the 
cell surface by a different type of cell wall anchor. 

[0062] For the production of a chimeric protein comprising the scFv-HCG followed by the C-terminal part of the 
25 FL01 -protein, plasmid pUR2990 (see Figure 1 1 ) can be used as a starting vector. The preparation of episomal plasmid 
pUR2990 was described in our co-pending patent application WO-94/01567 (UNILEVER) published on 20 January 
1994, i.e. during the priority year. Plasmid pUR2990 comprises the chimeric gene consisting of the gene encoding the 
Humicola lipase and a gene encoding the putative C-terminal cell wall anchor domain of the FLOl gene product, the 
chimeric gene being preceded by the invertase signal sequence (SUC2) and the GAL7 promoter; further the plasmid 
30 comprises the yeast 2 um sequence, the defective Leu2 promoter described by Eckard and Hollenberg (1983), and 
the Leu2 gene, see Roy c.s. (1991). Plasmid pUR4146, described in Example 4, can be digested with Psfl and Xho\, 
and the about 0.7 kbp Pst\-Xho\ fragment containing the scFv-HCG coding sequence can be isolated. For the in frame 
fusion of this DNA sequence between the C-terminal FL01 part and the SUC2 signal sequence, the fragment can be 
directly ligated with the 9,3 kbp Eag\fNhe\ (partial) backbone of plasmid pUR2990, resulting in plasmid pUR4196 (see 
35 Figure 12). This plasmid will comprise an additional triplet encoding Ala at the transition between the SUC2 signal 
sequence and the start of the scFv-HCG, and a E-l-K-G-G amino acid sequence in front of the first amino acid (Ser) 
of the C part of FL01 protein. 

[0063] If in the previous Examples 1-5 the level of exposed antibody fragments is too low, the production level can 
he increased by mutagenesis of the frame work regions of the antibody fragment. This can he done in a site directed 
40 way or by (targeted) random mutagenesis, using techniques described in the literature. 

Example 6. Construction of a gene encoding a chimeric protein that will be anchored in the cell wall of a lower 
eukaryote and is able to bind cholesterol. 

45 [0064] In the literature two DNA sequences for cholesterol oxidase are described, the choB gene from Brevibacterium 
sterolicum, see Ohta c.s. (1991) and the choA gene from Streptomyces sp. SA-COO, see Ishizaka c.s. (1989). For 
the construction of a DNA fusion between the choS gene coding for cholesterol oxidase (EC 1.1.3.6) and the 3' part 
of the AG-a1 gene, the PCR technique on chromosomal DNA can be applied. Chromosomal DNA can be isolated by 
standard techniques from Brevibacterium sterolicum, and the DNA part coding for the mature part of the cholesterol 

50 oxidase can he amplified through application with the following corresponding PCR primers choOlpcr and cho02pcr: 
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choOlpcr 

5-- ccc ccc agc ccc acc ctc o-3* = SEQ ID NO: 16 

3*- CGC CGG TCG GCG TGG GAG C-5' = SEQ ID NCh 17 

t!l III III III III III I 

S'-ACAT CTCAATTCGCGCCC CC C CCC AGC CGC ACC CTC Q-3' = SEQ ID NO: 18 
CCORX KOtZ 

E&gl 



cho02pcr 

Hhel Hin&lll 

3 • -TAG TAG AGC ACC CTG TAG GTC CCATCG AC TTTCCfrA TCTAGA-S ' = SEQ ID NO: 19 

III III III III III lit III 
5 # -ATC ATC TCG TCC GAC ATC CAG-3 1 = SEQ ID NO: 20 

3* -TAG TAG AGC AGG CTG TAG GTC-5* s SEQ ID NO: 21 



[0065] Both primers can specifically hybridize with the target sequence, thereby amplifying the coding part of the 
gene in such a way, that the specific PCR product -after Proteinase K treatment and digestion with EcoRI and H/'ndlll- 
can be directly cloned into a suitable vector, here preferably pTZ19R, see Mead c.s. (1986). This will result in plasmid 
pUR2985 (see Figure 13). 

[0066] In addition to the already mentioned restriction sites both PCR primers generate other restriction sites at the 
5' end and the 3' end of the 1.5 kbp DNA fragment, which can be used later on to fuse the fragment in frame between 
either the SUC2 signal sequence or the prepro-a-mating factor signal sequence on one side and the C-terminus coding 
part of the a-agglutinin gene on the other side. To facilitate the ligation behind the prepro-MF sequence a Not\ site is 
introduced at the 5* end of PCR oligonucleotide choOlpcr, allowing for example, the exchange of the 731 bp Eag\/Nhe\ 
fragment containing the scFv-Lys coding sequence in pUR4175 for the choB coding sequence. 
[0067] To create an enzymatically inactive fusion protein between cholesterol oxidase and a-agglutinin, the above 
described subcloning into pTZ1 9R can be used. Cholesterol oxidase is an FAD-dependent enzyme for which the crystal 
structure of the Brevibacterum sterolicum enzyme has been determined, see Vrielink c.s. (1991). The enzyme displays 
homology with the typical pattern of the FAD-binding domain with the Gly-X-Gly-X-X-Gly sequence near the N-terminus 
(amino acid 1 8-23). Site-directed in vitro mutagenesis on the plasmid pUR2985 according to the manufacturer's protocol 
(Muta-Gene kit, Bio-Rad) can be applied to inactivate the FAD-binding site through replacing the triplet(s) encoding 
the Gly residue(s) by triplets encoding other amino acids, thereby presumably inactivating the enzyme. E.g. the fol- 
lowing primer can be used for site-directed mutagenesis of 2 of the conserved Gly residues. 



PT 3'- CGG GAG CAG TAG CGG TCA CGT ATG CCG CCA CGG CAG CGG CGC -5* 

iii in iii in i i ill ; : is: in m m in in in 

CB 5*- GCC CTC GTC ATC GGC ACT GGA TAC GGC GGT CCC GTC CCC COO -3' 

Ala ' Gly Gly Gly Gly Ala Ala Ala 

l 4 

Ala Ala 

pr « primer » SEQ ID NO: 22 

cs = coding strand = SEQ ID NO: 23 



[0068] As a result of the mutagenesis with the described primer, plasmid pUR2986 will be obtained. From this plasmid 
the DNA coding for the presumably inactivated cholesterol oxidase can be released as a 1527 bp fragment through 
Not\/Nhe\ digestion, and subsequently directly used to exchange the scFv-Lys coding sequence in pUR4175, thereby 
generating plasmid pUR2987 (see Figure 14). To obtain a variant yeast secretion vector, where the secretion is directed 
through the SUC2 signal sequence, for example the 1823 bp long Sac\/Nhe\ segment of plasmid pUR2986 can be 
used to replace the Sac\/Nhe\ fragment in pUR4174. 

[0069] This inactivation of the FAD-binding site might be preferable over other mutations, since an unchanged active 
centre'can be expected to leave the binding properties of cholesterol oxidase for cholesterol unaltered. Instead of the 
described Gly-Ala exchanges at position 18 and 20 of the mature coding sequence, every other suitable amino acid 
change can also be performed. 

[0070] To inactivate the enzyme, site directed mutagenesis can be optionally immediately performed in the active 
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site cavity, for example through exchange of the Glu331, a residue appropriately positioned to act as the proton ac- 
ceptor, thus generating a new variant of an immobilized, enzymatically inactive fusion protein. 

Example 7. Construction of a gene encoding a chimeric protein that will be anchored in the cell wall of a lactic 
5 acid bacterium and is able to bind cholesterol. 

[0071] It has been described that proteinase of Lactococcus lactis subsp. cremoris is anchored to the cell wall through 
its 1 27 amino acid long C-terminal, see Kok c.s. (1988) and Kok (1990). In a way similar to that described in Example 
6, the cholesterol oxidase of Brevibacterium sterolicum (cnoB) can be immobilized on the surface of Lactococcus lactis. 

10 Fusions can be made can be made between the choB structural gene and the N-terminal signal sequence and the C- 
terminal anchor of the proteinase of Lactococcus lactic. Plasmid pGKV550 (see Figure 15) contains the complete 
proteinase operon of Lactococcus lactis subsp. cremoris Wg2, including the promoter, a ribosome binding site and 
DNA fragments encoding the already mentioned signal and anchor sequences, see Kok (1 990). First a DN A fragment, 
containing the main part of the signal sequence, flanked by a C/al site and an Eagl site can be constructed with PGR 

15 on pGKV550 as follows: 



20 



Primer prtl: 

5 1 -AA GAT C TA TCG AT C TTG TTA GCC GGT ACA-3 1 = SEQ ID NO: 24 

Proteinase gene (non coding strand): 

3'-TT CCC GA T AGC TAG AAC AAT CGG CCA TGT CAG-5' 

CJal - SEQ ID NO: 25 



25 



30 



Proteinase gene: Gin Ala Lys 

5 1 -GTC GGC GAA ATC CAA GCA AAG GCG GCT-3 ■ = SEQ ID NO: 26 

Primer prt2: = SEQ ID NO: 27 

V-CAG CCG CTT TAG GTT CGT T GC CGG C CC CCC TTC GAA CCC-5 1 

Eagl Hindlll 



35 [0072} After the PCR reaction as described in Example 6, the 98 bp long PCR fragment can be isolated and digested 
with C/al and Hind\\\. pGKV550 can subsequently be cleaved partially with C/al and completely with HindlU, after which 
digestions the vector fragment, containing the promoter, the ribosome binding site, the DNA fragment encoding the N- 
terminal 8 amino acids and the cell wall binding fragment containing the 127 C-terminal amino acids of the proteinase 
gene can be isolated on gel. 

40 [0073] A copy of the cholesterol oxidase gene, suitable for fusion with the prtP anchor domain can be produced by 
a PCR reaction using plasmid pUR2985 (Example 6) as template and a combination of primer choOlpcr (see Example 
6) and the following primer cho03pcr instead of primer cho02pcr: 



45 cho03pcr Wmdlll 

3 • -TAG TAG AGC AGG CTC TAG GTC CGA G TT CGA AC C TAG GC-5 * = SEQ ID NO: 40 

III ill III III ill III III 
lit iii ill tit ill til ill 

5' -ATC ATC TCG TCC GAC ATC CAG = SEQ ID NO: 20. 



50 

[0074] The about 1 .53 kbp fragment generated by this reaction can be digested with Not\ and H/ndlll to produce a 
molecule which can subsequently be ligated with the large £agl/H//idlll fragment from pUR2988 (see Figure 16). The 
resulting plasmid, pUR2989, will contain the cholesterol oxidase coding sequence inserted between the signal se- 
quence and the C-terminal cell wall anchor domain of the proteinase gene. 
55 [0075] After introduction into Lactobacillus lactis subsp. lactis MG1363 by electroporation, this plasmid will express 
cholesterol oxidase under control of the proteinase promoter. The transport through the membrane will be mediated 
by the proteinase signal sequence and the immobilization of the cholesterol oxidase by the proteinase anchor. As it is 
unlikely that the Lactococcus will secrete FAD as well, the cholesterol oxidase will not be active but will be capable to 
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bind cholesterol. 

Example 8. Construction of a gene encoding a chimeric protein that will be anchored in the cell wall of a lower 
eukaryote and is able to bind growth hormones, such as the epidermal growth factor. 

5 

[0076] For the isolation of larger amounts of human epidermal growth factor (EGF) the corresponding receptor can 
be used in form of a fusion between the binding domain and a C-terminal part of ct-agglutinin as cell wall anchor. The 
complete cDNA sequence of the human epidermal growth factor is cloned and sequenced. For the construction of a 
fusion protein with EGF binding capacity the N-terminal part of the mature receptor until the central 23 amino acids 

10 transmenbrane region can be utilized. 

[0077] The plasmid pUR4175 can be used for the construction. Through digestion with Eagl and Nhe\ (partial) a 731 
bp DNA fragment containing the sequence coding for scFv is released and can be replaced by a DNA fragment coding 
for the first 621 amino acids of human epidermal growth factor receptor. Initiating from an existing human cDNA library 
or otherwise through production of a cDNA library by standard techniques from preferentially EGF receptor overex- 

15 pressing cells, e.g. A431 carcinoma cells, see Ullrich c.s. (1984), further PCR can be applied for the generation of in 
frame linkage between the extracellular binding domain of the human growth factor receptor (amino acid 1-622) and 
the C-terminal part of a-agglutinin. 

[0078] PCR oligonucleotides for the in frame linkage of human epidermal growth factor receptor and the C-terminus 
of a-agglutinin. 

20 

a: PCR oligonucleotides for the transition between SUC2 signal sequence and the N-terminus of mature EGF 
receptor. 



> mature EGF receptor 



pri EGF1: 




Ala 


Leu 


GlU 




I*ys 


Lys 


val 


= SEQ ID NO: 28 


5 1 -GGG 


GCG GCC 


GCG 


CTG 


GAG 


GAA 


AAG 


AAA 


GTT 


TGC-3 1 


3 1 -CGC 


Natl 
TCA GCC 


CGA 


i i i 
i i i 
GAC 


» i i 
i i i 
CTC 


i i i 
i i i 
CTT 


1 1 i 
i i i 
TTC 


i t i 
t i i 
TTT 


i i i 
i t i 
CAA 


i i i 
i i i 

ACG 5' 



30 EGF rec (non-coding strand): ■ SEQ ID NO: 29 



b: PCR oligonucleotides for the in frame transition between C terminus of the extracellular binding domain of EGF 
receptor and the C terminal part of a-agglutinin. 

35 

EGF rec (coding strand): 





Asn 


Gly 


Pro 




He 


Pro 


Ser 


Ala Thr 






-AAT 


GGG 


CCT 


AAG 


ATC 


CCG 


TCC 


ATC GCC ACT-3 1 


*= SEQ ID NO: 30 




i i i 
i i i 


i i i 
i i i 


i i i 
» i i 


\ i i 
i i i 


» i i 
i i i 


i i i 

i i i 


i t i 
i i i 




= SEQ ID NO: 31 


3 f 


-TTA 


CCC 


GGA 


TTC 


TAG 


GGC 


AGG 


CGA TCGGAATTCGAA 


CCCC-5 1 



pr EGF2: Nhel Hindlll 

45 [0079] This fusion would result in an addition of 2 Ala amino acids between the signal sequence and the mature N- 
terminus of EGF receptor. 

[0080] The newly obtained 1.9 kbp PCR fragment can be digested with Nott and Nhe\ and directly ligated into the 
vector pUR4 175 after digesting with the same enzymes, resulting in plasmid pUR2993 (see Figure 17), comprising the 
GAL7 promoter, the prepro-a-mating factor sequence, the chimeric EGF receptor binding domain gene / a-agglutinin 
50 gene, the yeast 2 \im sequence, the defective LEU2 promoter and the LEU2 gene. This plasmid can he transformed 
into S. cerevisiae and the transformed cells can be cultivated in YP medium whereby expression of the chimeric protein 
can be induced by adding galactose to the medium. 

Example 9. Construction of genes encoding a chimeric protein anchored to the cell wall of yeast, comprising 
55 a binding domain of a "Camelidae" heavy chain antibody 

[0081] Recently it was described that camels as well as a number of related species (e.g. lamas) contain a consid- 
erable,amount.of IgG antibody molecules which are only composed of heavy-chain dinners, see Hamers-Casterman 
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c.s. (1993). Although these "heavy-chain" antibodies are devoid of light chains, it was demonstrated, that they never- 
theless have an extensive antigen-binding repertoire. In order to show that the variable regions of this type of antibodies 
can be produced and will be linked to the exterior of the cell wall of a yeast, the following constructs were prepared. 

5 Construction of pUR2997, pUR2998 and pUR2999 

[0082] The about 2.1 kbp Eagl-H/ndlll fragment of pUR4177 (Example 4, Fig 9) was isolated. By using PCR tech- 
nology, an EcoRI restriction site was introduced immediately upstream of the Eagl site, whereby the C of the EcoRI 
site is the same as the first C of the Eagl site. The thus obtained EcoRI-H/ndlll fragment was ligated into plasmid 
10 pEMBL9, which was digested with EcoRI and H/ndlll, which resulted in pUR4177.A 

[0083] The EcoRUNhel fragment of plasmid pUR41 77 .A was replaced by the EcoR\INhe\ fragments of three different 
synthetic DNA fragments (SEQ ID NO: 32, SEQ ID NO: 33, and SEQ ID NO: 34) resulting in pUR2997, pUR2998 and 
pUR2999, respectively. The about 1.5 kbp 8sO-H/ndlll fragments of pUR2997 and pUR2998 were isolated. 

15 Construction of pUR4421 

[0084] The multiple cloning site of plasmid pEMBL9, see Dente c.s. (1983), (ranging from the EcoRI to the H/ndlll 
site) was replaced by a synthetic DNA fragment having the nucleotide sequence given below, see SEQ ID NO: 35 
giving the coding strand and SEQ ID NO: 36 giving the non-coding strand. The 5-part of this nucleotide sequence 

20 comprises an Eagl site, the first 4 codons of a Camelidae V H gene fragment (nucleotides 16-27) and a Xhol site 
(CTCGAG) coinciding with codons 5 and 6 (nucleotides 28-33). The 3-part comprises the last 5 codons of the Camel- 
idae V H gene (nucleotides 46-60) (part of which coincides with a BsfEII site), eleven codons of the Myc tail (nucleotides 
61-93), see SEQ ID NO: 35 containing these eleven codons and SEQ ID NO: 37 giving the amino acid sequence, and 
an EcoRI site (GAATTC). The EcoRI site, originally present in pEMBL9, is not functional any more, because the 5- 

25 end of the nucleotide sequence contains AATTT instead of AATTC, indicated below as (EcoRI). The resulting plasmid 
is called pUR4421. The Camelidae V H fragment starts with amino acids Q-V-K and ends with amino acids V-S-S. 



(EcoRI) Eagl Xhol BstEII 

30 5 • - AATTT AG CGG CCGCCCAGGT GAAACTCCTC GAfiTAAGTGA CTAA GGTCAC - 50 
3» 1 ATCGCC GGCGGGTCCA CTTTGACGAG CTCATTCACT GATTCCAGTG- 
5 Q V K 

-£GTCTCCTCA GAACAAAAAC TCATCTCAGA AGAGGATCTG AATTAATGAG- 100 

35 -GCAGAGGAGT CTTGTTTTTG AGTAGAGTCT TCTCCTAGAC TTAATTACTC- 

VSS EQK LISE EDL N * * 

= SEQ ID NO: 37 

EcoRI Hindlll 

-M22CATCAA ACGGTGATA -3 » 119 = SEQ ID NO: 35 

40 -TTAAGTAGTT TGCCACTATT CGA -5' 123 ■ SEQ ID NO: 36 



Construction of pUR4424 

45 [0085] After digesting the plasmid pB09 with Xhol and BsfEII, a DNA fragment of about 0.34 kbp was isolated from 
agarose gel. This fragment codes for a truncated V H fragment, missing both the first 4 and the last 5 amino acids of 
the Camelidae V H fragment. Plasmid pB09 was deposited as E. coli JM109 pB09 at the Centraal Bureau voor Schim- 
melcultures, Baarn on 20 April 1 993 with deposition number CBS 271 .93. The DNA and amino acid sequences of the 
Camel V H fragments followed by the Flag sequence as present in plasmid pB09 were given in Figure 6B of European 

50 patent application 93201239.6 (not yet published), which is herein incorporated by reference. The obtained about 
0.34 kbp fragment was cloned into pUR4421. To this end plasmid pUR4421 was digested with Xnol and H/ndlll, after 
which the about 4 kb vector fragment was isolated from an agarose gel. The resulting vector was ligated with the about 
0.34 kbp XhollBstEU fragment and a synthetic DNA linker having the following sequence: 



55 



BstEII Jfijjdill 

GTCACC GTCTCCTCATAATGA, o SEQ ID NO: 38 

GCAGAGGAGTATTACTTCGA » SEQ ID NO: 39 
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resulting in plasmid pUR4421-09. 

[0086] Plasmid pSY16 was digested with Eagl and H/ndlll, after which the about 6.5 kbp long vector backbone was 
isolated and ligated with the about 0.38 kbp Eagl/H/ndlll fragment from pUR4421-09 resulting in pUR4424. 

5 Construction of pUR4482 and pUR4483 

[0087] From pUR4424 the about 0.44 kbp Sacl-BsfEII fragment, coding for the invertase signal sequence and the 
camel heavy chain variable 09 (= CH V 09) fragment, was isolated as well as the about 6.3 kbp Sacl-H/ndlll vector 
fragment. The about 6.3 kbp fragment and the about 0.44 kbp fragment from pUR4424 were ligated with the BstEM-Hin- 

10 dill fragment from pUR2997 or pUR2998 yielding pUR4482 and pUR4483, respectively. 

[0088] Plasmid pUR4482 is thus an yeast episomal expression plasmid for expression of a fusion protein with the 
invertase signal sequence, the CH^ variable region, the Myc-tail and the Camel "X-P-X-P" Hinge region, see Ham- 
ers-Casterman as. (1993), (1993), and the a-agglutinin cell wall anchor region. Plasmid pUR4483 differs from 
pUR4482 in that it contains the Myc-tail hut not the "X-P-X-P" Hinge region. Similarly, the BsfEII-H/ndlll fragment from 

15 pUR2999 can be ligated with the about 6.3 kbp vector fragment and the about 0.44 kbp fragment from pUR4424, 
resulting in pUR4497, which will differ from pUR4482 in that it contains the "X-P-X-P" Hinge region but not the Myc-tail. 
[0089] The plasmids pUR4424, pUR4482 and pUR4483 were introduced into Saccharomyces cerevisiae SU10 by 
electroporation, and transformants were selected on plates lacking leucine. Transformants from SU10 with pUR4424, 
pUR4482 or pUR4483, respectively, were grown on YP with 5% galactose and analysed with immune-fluorescence 

20 microscopy, as described in Example 1 of our co-pending WO-94/01567 (UNILEVER) published on 20 January 1994. 
This method was slightly modified to detect the chimeric proteins, containing both the camel antibody and the Myc tail, 
present at the cell surface. 

[0090] In one method a monoclonal mouse anti-Myc antibody was used as a first antibody to bind to the Myc part of 
the chimeric protein; subsequently a polyclonal antimouse 1g antiserum labeled with fluorescein isothiocyanate (= 
25 FITC) ex Sigma, Product No. F-0527, was used to detect the bound mouse antibody and a positive signal was deter- 
mined by fluorescence microscopy. 

[0091] In the other method a polyclonal rabbit anti-human IgG serum, which had earlier been proven to cross-react 
with the camel antibodies, was used as a first antibody to bind the camel antibody part of the chimeric protein; subse- 
quently a polyclonal anti-rabbit 1g antiserum labeled with FITC ex Sigma, Product No. F-0382, was used to detect the 
30 bound rabbit antibody and a positive signal was determined by fluorescence microscopy. 

[0092] The results in Figure 1 9 and Figure 20 show clearly that fluorescence can be observed on those cells in which 
a fusion protein of the CH V 09 fragment with the a-agglutinin cell wall anchor region is produced (pUR4482 and 
pUR4483). No fluorescence however, was visible on the cells which produce the CH V 09 fragment without this anchor 
(pUR4424), when viewed under the same circumstances. 

35 

Patent literature references: 

[0093] EP-A1 -02551 53 (UNILEVER) Production of guar alpha-galactosidase by hosts transformed by recombinant 
DNA methods. First priority date 03.06.86; published 03.02.88 
40 [0094] WO-9 1/00920 (UNILEVER) Process for preparing a protein by a fungus transformed by multicopy integration 
of an expression vector. First priority date 07.07.89; published 24.01.91 

[0095] WO-91/19782 (UNILEVER) Xylanase production. Priority date 19.06.90; published 26.12.91 
[0096} WO-94/01567 (UNILEVER) Process for immobilizing enzymes to the cell wall of a microbial cell by producing 
a fusion protein. First priority date 08.07.92; published 20.01.94 
45 [0097] EP patent application 93201239.6 (not yet published) Production of antibodies or (functionalized) fragments 
thereof derived from heavy chain immunoglobulins oWamelidae. Filing date 29.04.93 

Non-patent literature references: 

50 [0098] R.E. Bird & B. Webb Walker Single chain antibody variable regions. TIBTECH 9 (April 1 991 ) 1 32-1 37 

A. Conzelmann, C. Fankhauser & C. Desponds Myoinositol gets incorporated into numerous membrane glycopro- 
teins of Saccharomyces cerevisiae; incorporation is dependent on phosphomannomutase {SEC53). The EMBO Journal 
9, No.3 (1990) 653-661 

L. Dente, G. Cesareni & R. Co ties e pEMBL: a new family of single stranded plasmids. Nucleic Acids Research 11 , 
55 No. 6(1983)1645-1655 

E. Erhart, & CP. Hollenberg The Presence of a Defective Leu2 Gene on 2ji DNA Recombinant Plasmids of Saccha- 
romyces cerevisiae Is Responsible for Curing and High Copy Number. Journal of Bacteriology. 156 , No.2 (Nov. 1983) 
625-635 
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C. Hamers-Casterman, T. Atarhouch, S.Muyldermans, G. Robinson, C. Hamers, E. Bajyana Songa, N. Bendah- 
man & R. Hamers Naturally occurring antibodies devoid of light chains. Nature 363 (3 June 1993) 446-448. 

M.M. Harmsen, A.C. Langedijk, E. van Tuinen, R.H. Geerse, H.A. Raue, & J. Maat Effect of pmrl disruption and 
different signal sequences on the intracellular processing and secretion of Cyamopsis tetragonoloba ct-galactosidase 
5 by Saccharomyces cerevisiae. Gene 125(1 993) 115-123 

T. Ishizaki, N. Hirayama, H. Shinkawa, O. Nimi, & Y. Murooka Nucleotide Sequence of the Gene for Cholesterol 
Oxidase from a Streptomyces sp. Journal of Bacteriology 171, No.1 (Jan. 1989) 596-601 

J. Kok, K.J. Leenhouts, A.J. Haandrikman, A.M. Ledeboer & G. Venema Nucleotide Sequence of the Cell Wall 
Proteinase Gene of Streptococcus cremoris Wg2. Applied and Environmental Microbiology 54, No. 1 (Jan. 1988) 
10 231-238 

J. Kok Genetics of the proteolytic system of lactic acid bacteria. FEMS Microbiology Reviews. 87 (1990) 15-42, esp. 
item 3.2 and Figure 7 on pages 22-23 R. Lewin The universal constructor set. New Scientist (8 December 1 990) 30-33 
P.N. Lipke, D. Wojciechowicz & J. Kurjan AGa1 Is the Structural Gene for the Saccharomyces cerevisiae oc-Agglutinin, 
a Cell Surface Glycoprotein Involved in Cell-Cell Interactions during Mating. Molecular and Cellular Biology. 9, No.8 
15 (Aug. 1989)3155-3165. 

D. A. Mead, E. Szczesna-Skorupa & B. Kemper Single-stranded DNA 'blue' T7 promoter plasmids: a versatile tandem 
promoter system for cloning and protein engineering. Protein Engineering^, No. 1 (1986) 67-74 

R. Orlandi, D.H. Giissow, P.T. Jones, & G. Winter Cloning immunoglobulin variable domains for expression by the 
polymerase chain reaction. Proc. Natl. Acad. Sci. USA 86 (May 1989) 3833-3837 
20 B.C. Ossendorp The non-specific lipid-transfer binding protein: a recombinant DNA and immunological approach. 
Thesis University of Utrecht (30 September 1992) T. Ohta, K. Fujishiro, K. Yamaguchi, Y. Tamura, K. Aisaka, T. 
Uwajima & M. Hasegawa Sequence of gene choB encoding cholesterol oxidase of Brevibacterium sterolicum: com- 
parison with choA of Streptomyces sp. SA-COO. Gene 103 (1991 ) 93-96 

A. Roy, C.F. Lu, D.L. Marykwas, P.N. Lipke & J. Kurjan The AGA1 Product Is Involved in Cell Surface Attachment 
25 of the Saccharomyces cerevisiae Cell Adhesion Glycoprotein a-Agglutinin. Molecular and Cellular Biology 11^ No.8 
(Aug. 1991)4196-4206 

M.P. Schreuder, S. Brekelmans, H. van den Ende, & F.M. Klis Targeting of a Heterologous Protein to the Cell Wall 
of Saccharomyces cerevisiae. Yeast 9 (1993) 399-409 

A. Ullrich, L. Coussens, J.S. Hayflick, T.J. Dull, A. Gray, A.W. Tarn, J. Lee, Y. 

30 Yarden, T.A. Libermann, J. Schlessinger, J. Downward, E.L.V. Mayes, N. Whittle, 

M.D. Waterfield & P.H. Seeburg Human epidermal growth factor receptor cDNA sequence and aberrant expression 
of the amplified gene in A431 epidermoid carcinoma cells. Nature 309 (31 May 1984) 418^425 
J.M.A. Verbakel Heterologous gene expression in the yeast Saccharomyces cerevisiae. Thesis University of Utrecht 
(1 May 1991), esp. pages 76-89 

35 A. Vrielink, L.F. Lloyd, & D.M. Blow Crystal Structure of Cholesterol Oxidase from Brevibacterium sterolicum Refined 
at 1.8 A Resolution. J. Mol. BioL, 219 (1991 ) 533-554 

E. S. Ward, D. Gussow, A.D. Griffiths, P.T. Jones & G. Winter. Binding activities of a repertoire of single immu- 
noglobulin variable domains secreted from E. coli. Nature 3fM (12 October 1989) 544-546 

[0099] Information on a deposit of a micro-organism under the Budapest Treaty is given on page 26, lines 5-7 above . 
40 in agreement with Rule 28 (4) EPC, or a similar arrangement for a State not being a Contracting State of the EPC, it 
is hereby requested that a sample of such deposit, when requested, will be submitted to an expert only. 

SEQUENCE LISTING 

45 [0100] 

(1) GENERAL INFORMATION: 

(i) APPLICANT: 

50 

(A) NAME: Unilever N.V. 

(B) STREET: Weena 455 

(C) CITY: Rotterdam 

(E) COUNTRY: The Netherlands 
55 (F) POSTAL CODE (ZIP): NL-3013 AL 

(A) NAME: Unilever PLC 

(B) STREET: Unilever House Blackfriars 



16 



EP 0 682 710 B1 



(C) CITY: London 

(E) COUNTRY: United Kingdom 

(F) POSTAL CODE (ZIP): EC4P 4BQ 

(A) NAME: Leon Gerardus J. FRENKEN 

(B) STREET: Geldersestraat 90 

(C) CITY: Rotterdam 

(E) COUNTRY: The Netherlands 

(F) POSTAL CODE (ZIP): NL-3011 MP 

(A) NAME: Pieter DE GEUS 

(B) STREET: Boeier 24 

(C) CITY: Barendrecht 

(E) COUNTRY: The Netherlands 

(F) POSTAL CODE (ZIP): NL-2991 KB 

(A) NAME: Franciscus Maria KLIS 

(B) STREET: Benedenlangs 102 

(C) CITY: Amsterdam 

(E) COUNTRY: The Netherlands 

(F) POSTAL CODE (ZIP): NL-1025 KL 

(A) NAME: Holger York TOSCHKA; c/o Langnese Iglo, BR3 

(B) STREET: Aeckern 1 

(C) CITY: REKEN 

(E) COUNTRY: Germany 

(F) POSTAL CODE (ZIP): D-48734 

(A) NAME: cornelis Theodorus VERRIPS 

(B) STREET: Hagedoorn 18 

(C) CITY: Maassluis 

(E) COUNTRY: The Netherlands 

(F) POSTAL CODE (ZIP): NL-3142 KB 

- (ii) TITLE OF INVENTION: Immobilized proteins with specific binding capacities and their use in processes 
and products. 

(Hi) NUMBER OF SEQUENCES: 40 
(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1 .0, Version #1 .25 (EPO) 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 231 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 
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(B) CLONE: fragment in pUR4119 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

5 



10 



GAATTCGACC 


TCATCACACA 


AACAAACAAA 


ACAAAATGAt GCTTTTCCAA GCCTTTCTTT 


60 


TCCTTTTGGC TGGTTTTGCA 


GCCAAAATAT 


CTGCGCAGGT GCAGCTGCAG TAATGAACCA 


120 


CGGTCACCGT 


CTCCTCAGGT 


CGACGCGGTT 


CAGGCGGAGG TGGCTCTGGC GGTGGCGGAT 


180 


CGGACATCGA 


CCTCACTCAG 


ACCAAGCTCG 


AGATCAAACG GTGATAAGCT T 


231 



(2) INFORMATION FOR SEQ ID NO: 2: 

15 

, (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

20 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

25 (vii) IMMEDIATE SOURCE: 

(B) CLONE: linker Xhol-Nhel coding strand 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

30 

TCCACATCAA AOG CCCATCT O 21 

(2) INFORMATION FOR SEQ ID NO: 3: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

40 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
45 (vii) IMMEDIATE SOURCE: 

(B) CLONE: linker Xhol-Nhel non-coding strand 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

50 

CTAGCACATC CCC CTT TOAT C 21 

(2) INFORMATION FOR SEQ ID NO: 4: 

55 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: linker Eagl-Pstl coding strand 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

GGCCGCCCAG GTGCAGCTGC A 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: linker Eagl-Pstl non-coding strand 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GCTGCACCTG GGC 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: PGR primer A (heavy chain) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

ACGTSMARCT GCAGSAGTCW GG 

(2) INFORMATION FOR SEQ ID NO: 7: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: PCR primer B (heavy chain) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

TGACGAGACG CTCACCGTGG TCCCTTCOCC CC 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: PCR primer C (light chain) 
' (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

GACATTGAGC TCACCCAGTC TCCA 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B). CLONE: PCR primer D (light chain) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GTTTCATCTC CAGCTTGCTC CC 
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(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: linker EcoRI-Patl coding strand 
' (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

AATTCCGCCG TTCACCTCCA CCTCCA 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: linker EcoRI-Pstl non-coding strand 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

CCTCCACCTG AACCCCCC 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 714 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: ScFv antitraseolide 02/01/01 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
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CTGCAGGAGT 


CTGGACCTGG 


CCTGGTGAAA 


CCTTCTCAGT 


CTCTGTCCCT 


CACCTGCACT 


60 




GTCACTGGCT 


ACTCAATCAC 


CAGTGATTTT 


GCCTGGAACT 


GGATCCGGCA 


GTTTCCAGGA 


120 


c 
0 


AACCAACTCG 


AGTGGATGGG 


CTACATAAGC 


TACAGTGGTA 


GCACTAGCTA 


CAACCCATCT 


180 




CTCAAAAGTC 


GAATCTCTCT 


CACTCGAGAC 


ACATCCAAGA 


ACCAGTTCTT 


CCTGCAGTTG 


240 




AATTCTGTGA 


CTACTGAGGA 


CACAGCCACA 


TATTACTGTG 


CAACGTCCCT 


AACATGCTTA 


300 


w 


CTACGTCGGA 


AACGTTCTTA 


CTGGGGCCAA 


GGGACCACGG 


TCACCGTCTC 


CTCAGGTGGA 


360 




GGCGGTTCAG 


GCGGAGGTGG 


CTCTGGCGGT 


GGCGGATCGG 


ACATCGAGCT 


CACCCAGTCT 


420 




CCATCCTCCA 


TGTCTCTATC 


TCTGCGAGAC 


ACAGTCAGCA 


TCACTTGCCA 


TCCAAGTCAG 


480 


15 


GACATTAGCA 


GTAATATAGG 


CTGGTTCCAG 


CAGAAACCAG 


GGAAATCATT 


TAAGGGCCTG 


S40 




ATCTATCATG 


GAACCAACTT 


GGAAGATCCT 


ATTCCATCAA 


GGTTCAGTGG 


CAGTGGATCT 


600 




GGAGCAGATT 


ATTCCCTCAC 


CATCAGCAGC 


CTGGAATCTG 


AAGATTTTCC 


AGACTATTAC 


660 


20 


TGTGTACAGT 


ATGCTCAGTT 


TCCATTCAOC 


TTCGGCTCGG 


GGACCAAGCT 


CGA0 


714 



(2) INFORMATION FOR SEQ ID NO: 13: 
25 • (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 734 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
30 (0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vii) IMMEDIATE SOURCE: 

35 

(B) CLONE: ScFvanti-HCG 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
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CCCCCCTTCA CCTCCACCTG CAGCAGTCTG GCCGACACTT ACTGAAGCCT GCAGGGTCCC 60 

TGAAACTCTC CTGTGCAGCC TCTGGATTCG CTTTCAGTAC CTTTGACATG TCTTGGATTC 120 

5 GCCAGACTCC GGAGAAGAGG CTGGAGTGGG TCGCAAGCAT TACTAATGTT GGTACTTACA 160 

CCTACTATCC AGGCAGTGTG AAGGGCCGAT TCTCCATCTC CAGACACAAT GCCAGGAACA 240 

CCCTAAACCT GCAAATGAGC AGTCTGAGGT CTGAGGAGAC GCCCTTGTAT TTCTGTGCAA 300 

10 GACAGGGGAC TGCGGCACAA CCTTACTGGT ACTTCGATGT CTGGGGCCAA GGGACCACCG 360 

TCACCGTCTC CTCAGGTCCA GGCGGTTCAG GCGGAGGTGG CTCTCGCGGT GCCGGATOCG 420 

ACATCGAGCT CACCCAGTCT CCAAAATCCA TGTCCATGTC CGTAGCAGAG AGGGTCACCT 460 

15 TGAGCTGCAA GGCGAGTGAG ACTGTGGATT CTTTTGTGTC CTGGTATCAA CAGAAACCAG 540 

AACAGTCTCC TAAATTGTTG ATATTCGGGG CATCCAACCG GTTCAGTGGG GTCCCCGATC 600 

GCTTCACTGG CAGTGGATCT GCAACAGACT TCACTCTGAC CATCAGCAGT GTGCAGGCTG 660 

20 AGGACTTTCC CGATTACCAC TGTGGACAGA CTTACAATCA TCCGTATACG TTCGGAGGGG 720 

GGACCAAGCT CGAG 734 

25 

(2) INFORMATION FOR SEQ ID NO: 14: 
(i) SEQUENCE CHARACTERISTICS: 

30 (A) LENGTH: 2685 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

35 (jj) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 

40 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: pYY105 
45 (ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..2685 

(D) OTHER INFORMATION: /product= "Flocculation protein" /gene= "FL01" 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



55 
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ATG ACA ATC CCT CAT CCC TAT ATG TTT TTG CCA GTC TTT ACA CTT CTG 48 
Met Thr Met Pro His Arg Tyr Met Phe Leu Ala Val Phe Thr Leu Leu 
I 5 10 15 

5 CCA CTA ACT AGT GTG CCC TCA GGA GCC ACA GAG GCG TGC TTA CCA CCA 96 

Ala Leu Thr Ser Val Ala Ser Gly Ala Thr Glu Ala Cyn Leu pro Ala 
20 25 30 

GGC CAG AGG AAA ACT GGG ATG AAT ATA AAT TTT TAC CAG TAT TCA TTG 144 

Gly Gin Arg Lye Ser Gly Met Aen Zle Ann Phe Tyr Gin Tyr Ser Leu 

10 35 40 45 

AAA GAT TCC TCC ACA TAT TCG AAT CCA CCA TAT ATG CCT TAT GGA TAT 192 
Lys Asp Ser Ser Thr Tyr Ser Aen Ala Ala Tyr Met Ala Tyr Gly Tyr 
50 55 60 

15 CCC TCA AAA ACC AAA CTA CCT TCT GTC GGA GGA CAA ACT GAT ATC TCG 240 

Ala Ser Lys Thr Lys Leu Gly Ser Val Gly Gly Gin Thr Asp lie Ser 
65 70 75 80 

ATT GAT TAT AAT ATT CCC TGT CTT ACT TCA TCA GGC ACA TTT CCT TGT 288 
lie Asp Tyr Aen Zle Pro eye Val Ser Ser Ser Gly Thr Phe Pro Cys 
20 65 90 95 

CCT CAA GAA GAT TCC TAT GGA AAC TGG GGA TCC AAA GGA ATG GGT CCT 336 
Pro Gin Glu Asp Ser Tyr Gly Asn Trp Gly Cys Lys Gly Met Gly Ala 
100 105 110 

25 TGT TCT AAT ACT CAA GGA ATT CCA TAC TGG AGT ACT GAT TTA TTT GGT 384 

Cys Ser Asn Ser Gin Gly lie Ala Tyr Trp Ser Thr Asp Leu Phe Gly 
115 120 125 

TTC TAT ACT ACC CCA ACA AAC CTA ACC CTA GAA ATC ACA GGT TAT TTT 432 
Phe Tyr Thr Thr Pro Thr Asn Val Thr Leu Glu Met Thr Gly Tyr Phe 
30 130 135 140 

TTA CCA CCA CAG ACC GGT TCT TAC ACA TTC AAG TTT CCT ACA GTT GAC 480 

Leu Pro Pro Gin Thr Gly Ser Tyr Thr Phe Lys Phe Ala Thr val Asp 
145 150 155 160 

35 GAC TCT GCA ATT CTA TCA CTA GGT GGT CCA ACC GCG TTC AAC TGT TGT 528 

Asp Ser Ala He Leu Ser val Gly Gly Ala Thr Ala Phe Asn Cys Cys 
165 170 175 



40 



45 



50 



55 
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GCT CAA CAG CAA CCG CCG ATC ACA TCA ACG AAC TTT ACC ATT GAC CGT 576 

Ala Gin Gin Gin Pro Pro lie Thr ser Thr Asn Phe Thr He Asp Gly 
180 185 190 

5 ATC AAG CCA TGG GCT GGA AGT TTG CCA CCT AAT ATC CAA GGA ACC GTC 624 

He Lys Pro Trp Gly Gly Ser Leu Pro Pro Asn He Glu Gly Thr Val 
195 200 205 

TAT ATG TAC GCT CCC TAC TAT TAT CCA ATG AAG GTT GTT TAC TCG AAC 672 

Tyr Ket Tyr Ala Gly Tyr Tyr Tyr Pro Met Lys Val Val Tyr Ser Asn 
10 210 215 220 

GCT GTT TCT TGG CGT ACA CTT CCA ATT AGT GTG ACA CTT CCA GAT GGT 720 
Ala Val ser Trp Gly Thr Leu Pro He Ser val Thr Leu Pro Asp Gly 
225 230 235 240 

15 ACC ACT GTA AGT GAT GAC TTC GAA GGG TAC GTC TAT TCC TTT GAC GAT 768 

Thr Thr Val Ser Asp Asp Phe Glu Gly Tyr Val Tyr Ser Phe Asp Asp 
245 250 255 

GAC CTA AGT CAA TCT AAC TGT ACT GTC CCT GAC CCT TCA AAT TAT GCT 816 
A Bp Leu Ser Gin Ser Asn Cys Thr Val Pro Asp Pro Ser Asn Tyr Ala 
20 260 265 270 

GTC AGT ACC ACT ACA ACT ACA ACG GAA CCA TGG ACC GGT ACT TTC ACT 864 
Val Ser Thr Thr Thr Thr Thr Thr Glu Pro Trp Thr Gly Thr *he Thr 
275 260 265 

TCT ACA TCT ACT GAA ATG ACC ACC GTC ACC GGT ACC AAC GGC GTT OCA 912 

25 Ser Thr Ser Thr Glu Met Thr Thr Val Thr Gly Thr Asn Gly Val Pro 

290 295 300 

ACT GAC GAA ACC GTC ATT GTC ATC AGA ACT CCA ACC AGT GAA GGT CTA 960 

Thr Asp Glu Thr Val lie Val He Arg Thr Pro Thr Ser Glu Gly Leu 

305 310 315 320 

30 

ATC AGC ACC ACC ACT GAA CCA TGG ACT GGC ACT TTC ACT TCG ACT TCC 1008 
lie ser Thr Thr Thr Glu Pro Trp Thr Gly Thr Phe Thr Ser Thr Ser 
325 330 335 

ACT GAG GTT ACC ACC ATC ACT GGA ACC AAC GGT CAA CCA ACT GAC GAA 1056 
35 Thr Glu val Thr Thr He Thr Gly Thr Asn Gly Gin Pro Thr Asp Glu 

340 345 350 

ACT GTG ATT GTT ATC AGA ACT CCA ACC AGT GAA GGT CTA ATC AGC ACC 1104 
Thr Val He Val He Arg Thr Pro Thr Ser Glu Gly Leu He Ser Thr 
355 360 365 

40 ACC ACT GAA CCA TGG ACT GGT ACT TTC ACT TCT ACA TCT ACT GAA ATG 1152 

Thr Thr Glu Pro Trp Thr Gly Thr Phe Thr Ser Thr Ser Thr Glu Met 
370 375 380 

ACC ACC GTC ACC GGT ACT AAC GGT CAA CCA ACT GAC GAA ACC GTG ATT 1200 

Thr Thr Val Thr Gly Thr Asn Gly Gin Pro Thr Asp Glu Thr Val He 
45 385 390 395 400 

CTT ATC AGA ACT CCA ACC AGT GAA CGT TTG GTT ACA ACC ACC ACT GAA 1248 
Val He Arg Thr Pro Thr Ser Glu Gly Leu Val Thr Thr Thr Thr Glu 
405 410 415 

50 CCA TGG ACT GCT ACT TTT ACT TCG ACT TCC ACT GAA ATG TCT ACT GTC 1296 

Pro Trp Thr Gly Thr Phe Thr Ser Thr Ser Thr Glu Met Ser Thr val 
420 425 430 

ACT GGA ACC AAT GGC TTG CCA ACT GAT GAA ACT GTC ATT GTT GTC AAA 1344 
Thr Gly Thr Asn Gly Leu Pro Thr Asp Glu Thr Val He Val Val Lys 
55 435 440 445 
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ACT CCA ACT ACT GCC ATC TCA TCC ACT TTG TCA TCA TCA TCT TCA GGA 1392 
Thr Pro Thr Thr Ala He Ser Ser Ser Leu Ser Ser Ser Ser Sex- Gly 
450 455 460 

5 CAA ATC ACC AGC TCT ATC ACC TCT TCC CGT CCA ATT ATT ACC CCA TTC 1440 

Gin He Thr Ser Ser He Thr Ser Ser Arg Pro He He Thr Pro Phe 
465 470 475 460 

TAT CCT AGC AAT GGA ACT TCT GTG ATT TCT TCC TCA GTA ATT TCT TCC 1486 
Tyr Pro Ser Asn Gly Thr Ser Val He Ser Ser Ser Val He Ser Ser 
10 465 490 495 

TCA GTC ACT TCT TCT CTA TTC ACT TCT TCT CCA GTC ATT TCT TCC TCA 1536 
Ser Val Thr Ser Ser Leu Phe Thr Ser Ser Pro Val He Ser Ser Ser 
500 505 510 

15 GTC ATT TCT TCT TCT ACA ACA ACC TCC ACT TCT ATA TTT TCT CAA TCA 1584 

Val He Ser ser Ser Thr Thr Thr ser Thr ser He Phe Ser Glu Ser 
515 520 525 

TCT AAA TCA TCC GTC ATT CCA ACC ACT ACT TCC ACC TCT GGT TCT TCT 1632 

Ser Lys Ser Ser Val He Pro Thr Ser Ser Ser Thr Ser Gly Ser ser 
20 530 535 540 

GAG AGC GAA ACG ACT TCA CCT GGT TCT GTC TCT TCT TCC TCT TTT ATC 1680 
Glu Ser Glu Thr Ser Ser Ala Gly Ser Val ser ser Ser ser Phe He 
545 550 555 560 

25 TCT TCT GAA TCA TCA AAA TCT CCT ACA TAT TCT TCT TCA TCA TTA CCA 1726 

Ser Ser Glu Ser Ser Lye Ser Pro Thr Tyr Ser Ser Ser Ser Leu Pro 
565 570 575 

CTT GTT ACC AGT GCG ACA ACA AGC CAG GAA ACT GCT TCT TCA TTA CCA 1776 
Leu Val Thr Ser Ala Thr Thr Ser Gin Glu Thr Ala Ser Ser Leu Pro 
580 585 590 

30 ■ 

CCT GCT ACC ACT ACA AAA ACG AGC GAA CAA ACC ACT TTG GTT ACC GTG 1824 

Pro Ala Thr Thr Thr Lys Thr Ser Glu Gin Thr Thr Leu Val Thr Val 

595 600 605 

ACA TCC TGC GAG TCT CAT GTG TCC ACT GAA TCC ATC TCC CCT GCG ATT 1872 
35 Thr ser Cys Glu ser Hie Val Cys Thr Glu Ser He Ser Pro Ala He. 

610 615 620 

GTT TCC ACA GCT ACT GTT ACT GTT AGC GGC GTC ACA ACA GAG TAT ACC 1920 
Val Ser Thr Ala Thr Val Thr Val Ser Gly Val Thr Thr Glu Tyr Thr 
62S 630 635 640 

40 

ACA TGG TGC CCT ATT TCT ACT ACA GAG ACA ACA AAG CAA ACC AAA GGG 1968 
Thr Trp Cys Pro He Ser Thr Thr Glu Thr Thr Lya Gin Thr Lye Gly 
645 650 655 

ACA ACA GAG CAA ACC ACA GAA ACA ACA AAA CAA ACC ACG GTA GTT ACA 2016 
Thr Thr Glu Gin Thr Thr Glu Thr Thr Lya Gin Thr Thr Val Val Thr 
45 660 66S 670 

ATT TCT TCT TGT GAA TCT GAC -GTA TGC TCT AAG ACT GCT TCT CCA GCC 2064 
He Ser ser Cys Glu Ser Asp Val Cys Ser Lys Thr Ala Ser Pro Ala 
675 680 685 

50 ATT CTA TCT ACA ACC ACT CCT ACT ATT AAC GCC GTT ACT ACA GAA TAC 2112 

He Val Ser Thr Ser Thr Ala Thr He Asn Gly Val Thr Thr Glu Tyr 

690 695 700 

ACA ACA TGG TGT CCT ATT TCC ACC ACA GAA TCG AGG CAA CAA ACA ACG 2160 
Thr Thr Trp Cys Pro He Ser Thr Thr Glu Ser Arg Gin Gin Thr Thr 
55 70S 710 715 720 
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CTA GTT ACT GTT ACT TCC TCC GAA TCT GGT GTG TGT TCC GAA ACT GCT 

Leu Val Thr Val Thr ser Cys Glu Ser Gly Val Cys Ser Glu Thr Ala 
725 730 735 

TCA CCT GCC ATT GTT TCG ACG GCC ACG GCT ACT GTG AAT GAT GTT GTT 
Ser Pro Ala He Val Ser Thr Ala Thr Ala Thr Val Asn Asp Val Val 
740 745 750 

ACG GTC TAT CCT ACA TGG AGG CCA CAG ACT GCG AAT GAA GAG TCT GTC 
Thr Val Tyr Pro Thr Trp Arg Pro Gin Thr Ala Asn Glu Glu Ser Val 
755 760 765 

AGC TCT AAA ATG AAC AGT GCT ACC GGT GAG ACA ACA ACC AAT ACT TTA 
Ser Ser Lys Met Asn Ser Ala Thr Gly Glu Thr Thr Thr Asn Thr Leu 
770 775 780 

GCT GCT GAA ACG ACT ACC AAT ACT GTA GCT GCT GAG ACG ATT ACC AAT 
Ala Ala Glu Thr Thr Thr Asn Thr Val Ala Ala Glu Thr He Thr Asn 
785 790 795 800 

ACT GGA GCT GCT GAG ACG AAA ACA GTA GTC ACC TCT TCG CTT TCA AGA 

Thr Gly Ala Ala Glu Thr Lys Thr Val Val Thr Ser Ser Leu Ser Arg 

805 810 815 

TCT AAT CAC GCT GAA ACA CAG ACG GCT TCC GCG ACC GAT GTG ATT GGT 
Ser Asn His Ala Glu Thr Gin Thr Ala Ser Ala Thr Asp Val He Gly 
820 625 830 

CAC AGC AGT AGT GTT GTT TCT CTA TCC GAA ACT GGC AAC ACC AAG AGT 
His Ser Ser Ser Val Val Ser Val Ser Glu Thr Gly Asn Thr Lys Ser 
835 640 84S 

CTA ACA AGT TCC GGG TTG AGT ACT ATG TCG CAA CAG CCT CCT AGC ACA 
Leu Thr Ser Ser Gly Leu Ser Thr Met Ser Gin Gin Pro Arg ser Thr 
850 855 860 

CCA GGA AGC AGC ATG GTA GGA TAT AGT ACA GCT TCT TTA GAA ATT TCA 

Pro Ala Ser Ser Met Val Gly Tyr Ser Thr Ala Ser Leu Glu He Ser 
865 870 875 880 

ACG TAT GCT GGC AGT GCA ACA GCT TAC TGG CCG GTA GTG GTT TAA 
Thr Tyr Ala Gly Ser Ala Thr Ala Tyr Trp Pro Val Val Val 

885 890 895 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 894 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
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Met Thr Met Pro Bis Arg Tyr Met Phe Leu Ala Val Phe Thr Leu Leu 
15 10 15 

Ma Leu Thr Ser Val Ala Ser Gly Ala Thr Glu Ala Cys Leu Pro Ala 

20 25 30 

Gly Gin Arg Lys Ser Gly Met Asn lie Asn Phe Tyr Gin Tyr Ser Leu 
35 40 45 

Lys Asp Ser Ser Thr Tyr Ser Asn Ala Ala Tyr Met Ala Tyr Gly Tyr 
50 55 60 
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Ala Ser Lys Thr Lys Leu Gly Ser Val Gly Cly Gin Thr Asp He Ser 
65 70 75 80 



10 



He Asp Tyr Asn He Pro Cys Val Ser Ser Ser Gly Thr Phe Pro Cys 
85 90 95 

Pro Gin Glu Asp ser Tyr Gly Asn Trp Gly Cys Lys Gly Het Gly Ala 
. 100 105 110 

cys Ser Asn Ser Gin Cly He Ala Tyr Trp Ser Thr Asp Leu Phe Gly 
115 120 125 



Phe Tyr Thr Thr Pro Thr Asn Val Thr Leu Glu Met Thr Gly Tyr Phe 
130 135 140 



15 



Leu Pro Pro Gin Thr Gly Ser Tyr Thr Phe Lys Phe Ala Thr Val Asp 
145 150 155 160 



20 



Asp Ser Ala He Leu Ser Val Gly Gly Ala Thr Ala Phe Asn Cys Cys 
165 170 175 

Ala Gin Gin Gin Pro Pro He Thr Ser Thr Asn Phe Thr He Asp Gly 
180 185 190 



He Lys Pro Trp Gly Gly Ser Leu Pro Pro Asn He Glu Gly Thr Val 

195 200 205 
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Tyr Met Tyr Ala Gly Tyr Tyr Tyr Pro Met Lys Val Val Tyr Ser Asn 
210 215 220 
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Ala Val Ser Trp Gly Thr Leu Pro He Ser Val Thr Leu Pro Asp Gly 
225 230 235 240 

Thr Thr Val Ser Asp Asp Phe Glu Gly Tyr Val Tyr Ser Phe Asp Asp 
245 250 255 

Asp Leu Ser Gin Ser Asn Cys Thr Val Pro Asp Pro Ser Asn Tyr Ala 
260 265 270 

Val Ser Thr Thr Thr Thr Thr Thr Glu Pro Trp Thr Gly Thr Phe Thr 
275 280 285 

Ser Thr ser Thr Glu Met Thr Thr Val Thr Gly Thr Asn Gly Val Pro 
290 295 300 



40 



Thr Asp Glu Thr Val He Val He Arg Thr Pro Thr ser Glu Gly Leu 
305 310 315 320 

He Ser Thr Thr Thr Glu Pro Trp Thr Gly Thr Phe Tor Ser Thr Ser 

325 330 335 



45 



Thr Glu Val Thr Thr He Thr Gly Thr Asn Gly Gin Pro Thr Asp Glu 
340 345 350 

Thr Val He Val He Arg Thr Pro Thr Ser Glu Gly Leu He Ser Thr 
355 360 365 



50 



Thr Thr Glu Pro Trp Thr Gly Thr Phe Thr Ser Thr Ser Thr Glu Met 
370 375 3SO 

Thr Thr Val Thr Gly Thr Asn Gly Gin Pro Thr Asp Glu Thr Val He 
365 390 395 400 

Val He Arg Thr Pro Thr Ser Glu Gly Leu Val Thr Thr Thr Thr Glu 
405 410 415 



55 
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Pro Trp Thr Gly Thr Phe Thr Ser Thr Ser Thr Clu Met Ser Thr Val 
420 425 430 

Thr Cly Thr Asn Cly Leu Pro Thr Asp Glu Thr Val lie Val Val Lye 
435 440 445 

Thr pro Thr Thr Ala lie Ser Ser Ser Leu Ser Ser Ser Ser Ser Cly 
450 455 460 

Gin lie Thr Ser Ser lie Thr Ser Ser Arg Pro He He Thr Pro Phe 
465 470 475 480 

Tyr Pro Ser Asn Gly Thr Ser Val He Ser Ser Ser Val He Ser Ser 
485 490 495 

Ser Val Thr Ser Ser Leu Phe Thr Ser Ser Pro Val lie Ser Ser Ser 
500 505 510 

Val He Ser Ser Ser Thr Thr Thr Ser Thr Ser He Phe Ser Glu Ser 

515 520 525 

Ser Lye Ser Ser Val He Pro Thr Ser Ser Ser Thr Ser Gly Ser Ser 
530 535 540 

Glu Ser Glu Thr Ser Ser Ala Gly Ser Val ser Ser Ser ser Phe He 
545 550 555 560 

Ser Ser Glu Ser Ser Lys Ser Pro Thr Tyr Ser Ser Ser Ser Leu Pro 
565 570 575 

Leu Val Thr Ser Ala Thr Thr ser Gin Glu Thr Ala Ser Ser Leu Pro 
580 585 590 

Pro Ala Thr Thr Thr Lye Thr Ser Glu Gin Thr Thr Leu Val Thr Val 

595 600 605 

Thr Ser Cye Glu Ser His Val Cys Thr Glu Ser lie Ser Pro Ala lie 
610 615 620 

Val Ser Thr Ala Thr Val Thr Val ser Gly Val Thr Thr Glu Tyr Thr 
625 630 635 640 

Thr Trp Cys Pro He Ser Thr Thr Glu Thr Thr Lys Gin Thr Lys Gly 
645 650 655 

Thr Thr Clu Gin Thr Thr Clu Thr Thr Lye Gin Thr Thr Val Val Thr 
660 665 670 

He Ser Ser Cye Glu Ser Asp Val Cys Ser Lys Thr Ala Ser Pro Ala 
675 680 685 

He Val Ser Thr Ser Thr Ala Thr He Aan Gly Val Thr Thr Glu Tyr 
690 695 700 

Thr Thr Trp Cys Pro He ser Thr Thr Glu Ser Arg Gin Gin Thr Thr 
705 710 715 720 

Leu Val Thr Val Thr Ser Cys Glu Ser Gly Val Cys Ser Glu Thr Ala 

725 730 735 

Ser Pro Ala He Val Ser Thr Ala Thr Ala Thr Val Asn Asp Val Val 
740 745 750 

Thr Val Tyr Pro Thr Trp Arg Pro cln Thr Ala Asn Glu Glu ser Val 
755 760 765 
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15 



20 



25 



ser Ser Lys Met Asn Ser Ala Thr cly Glu Thr Thr Thr Aen Thr Leu 
770 775 780 

Ala Ala Glu Thr Thr Thr Asn Thr Val Ala Ala Glu Thr lie Thr Asn 
785 790 795 800 

Thr Gly Ala Ala Glu Thr Lys Thr Val Val Thr Ser Ser Leu Ser Arg 

805 810 815 

Ser Asn His Ala Glu Thr Gin Thr Ala Ser Ala Thr Asp Val lie Gly 
820 825 830 

His Ser Ser Ser Val Val Ser Val Ser Glu Thr Gly Asn Thr Lys Ser 
835 840 845 

Leu Thr Ser Ser Gly Leu Ser Thr Met Ser Gin Gin Pro Arg Ser Thr 
850 855 860 

Pro Ala Ser Ser Met Val Gly Tyr Ser Thr Ala Ser Leu Glu Xle Ser 
865 870 875 880 

Thr Tyr Ala Gly Ser Ala Thr Ala Tyr Trp Pro Val val Val 
885 890 

(2) INFORMATION FOR SEQ ID NO: 16: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

30 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPES DNA (genomic) 

35 (vii) IMMEDIATE SOURCE: 

(B) CLONE: ChoB template coding strand 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

40 

GCCCCCAGCC GCACCCTCG 19 

(2) INFORMATION FOR SEQ ID NO: 17: 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

50 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
55 (vii) IMMEDIATE SOURCE: 

(B) CLONE: ChoB template non-coding strand 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

CGAGGCTGCG GCTCGGGGC 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: choOlpcr primer 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

AGATCTGAAT TCGCGGCCCC CCCCAGCCGC RCC C TCG 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
' (vii) IMMEDIATE SOURCE: 

(B) CLONE: cho02pcr primer 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

AGATCTAAGC TTTCAGCTAQ CCTGGATOTC GGACG AGATG AT 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 
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(B) CLONE: ChoB template coding strand 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

ATCATCTCCT CCGACATCCA C 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: ChoB template non-coding strand 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

CTGGATGTCG GACGAGATCA T 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: mutagenesis primer ChoB 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

CCCGGCCACC GCACCGCCCT ATGCACTGGC CATGACGAGO CC 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(vii) IMMEDIATE SOURCE: 

(B) CLONE: ChoB template coding strand 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

CCCCTOGTCA TCGCCAGTCG ATACCCCGGT CCCGTCGCCG CO 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: primer prtl 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

AAGATCTATC GATCTTGTTA CCCGGTACA 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: proteinase template non-coding strand 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

GACTGTACCG GCTAACAAGA TCGATAGCCC TT 

(2) INFORMATION FOR SEQ ID NO: 26: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: proteinase template coding strand 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

GTCGCCCAAA TCCAAGCAAA GGCCGCT 

(2) INFORMATION FOR SEQ ID NO: 27: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

' (ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 
(B) CLONE: prt2 primer 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

CCCAAGCTTC CCCCCGGCCC TTGCTTCGAT TTCGCOGAC 

(2) INFORNATION FOR SEQ ID NO: 28: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

< (ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: EGF1 primer 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

GGCGCGGCCO CGCTGGAGOA AAACAAACTT TGC 

(2) INFORMATION FOR SEQ ID NO: 29: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: EGF receptor template non-coding strand 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

GCAAACTTTC TTTTCCTCCA GAGCCCGACT CCC 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: EGF receptor template coding strand 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

AATGGGCCTA AGATCCCGTC CATCGCCACT 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: EGF2 primer 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

CCCCAAGCTT AACGCTAGOO GACCCGATCT TAGCCCCATT 

(2) INFORMATION FOR SEQ ID NO: 32: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 177 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

5 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 
10 (B) CLONE: VhC - AGa1 linker with MycT and Hinge 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 



15 GAATTCCAGG TCACCGTCTC CTCAGAACAA AAACTCATCT CAGAAGAGGA TCTGAATGAA 60 

CCAAAGATTC CACAACCTCA ACCAAACCCA CAACCTCAAC CACAACCACA ACCAAAAOCT 120 
CAACCAAAGC CAGAACCAGA ATCTACTTCC CCAAAGTCTC CAGCT AG CCT TAAGCTT 177 

20 (2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

30 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: VhC - AGotl linker with MycT 

35 , (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

GAATTCCAGG TCACCGTCTC CTCAGAACAA AAACTCATCT CAGAAGACGA TCTGAATGCT 60 
AGC 63 

40 

(2) INFORMATION FOR SEQ ID NO: 34: 
(i) SEQUENCE CHARACTERISTICS: 

45 (A) LENGTH: 144 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

50 (ii) MOLECULE TYPE: DNA (genomic) 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: VhC - AGotl linker with Hinge 

55 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
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GAATTCCAGG TCACCGTCTC CTCACAACCA AAGATTCCAC AACCTCAACC AAAGCCACAA 60 

CCTCAACCAC AACCACAACC AAAACCTCAA CCAAAGCCAG AACCAGAATC TACTTCCCCA 120 

5 AAGTCTCCAO CTAGCCTTAA GCTT 144 

(2) INFORMATION FOR SEQ ID NO: 35: 
10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vii) IMMEDIATE SOURCE: 

20 

(B) CLONE: fragment in pUR4421 coding strand 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

25 

AATTTAGCGG CCGCCCAGGT GAAACTGCTC GAGTAAGTGA CTAAGGTCAC CGTCTCCTCA 60 
GAACAAAAAC TCATCTCAGA ACAGGATCTG AATTAATGAG AATTCATCAA ACGGTCATA 119 

30 (2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(it) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: fragment in pUR4421 non-coding strand 
45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 



50 



AGCTTATCAC CGTTTGATGA ATTCTCATTA ATTCAGATCC TCTTCTGAGA TGAGTTTTTG 60 
TTCTGAGGAG ACGGTGACCT TAGTCACTTA CTCGAGCAGT TTCACCTGGG CGGCCGCTA 119 



55 (2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(vii) IMMEDIATE SOURCE: 
(B) CLONE: Myc tail 
. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

Glu Gin Lye Leu He Ser Glu Glu Asp Leu Asn 
15 10 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: BstEII-Hindlll linker coding strand 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

GTCACCCTCT CCTCATAATO A 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: BstEll Hindlll linker non-coding strand 
' (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

AGCTTCATTA TGAGGAGACG 

(2) INFORMATION FOR SEQ ID NO: 40: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vii) IMMEDIATE SOURCE: (B) CLONE: primer cho03pcr 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

CGGATCCAAG CTTGAGCCTG GATGTCGGAC GAGATGAT 38 



Claims 

1. A process for carrying out an isolation process by using an immobilised binding protein or functional part thereof 
still capable of binding to a specific compound, wherein the immobilised protein is localized at the exterior of the 
cell wall of a host cell, wherein a medium containing said specific compound is contacted with a host cell under 
conditions whereby a complex between said specific compound and said immobilised binding protein is formed, 
separating said complex from the medium originally containing said specific compound, wherein the host cell is a 
fungus selected from the group consisting of yeasts and moulds, containing an expressible polynucleotide com- 
prising 

(i) a structural gene encoding the binding protein or a functional part thereof still having the specific binding 
capability, said binding protein or said functional part thereof being localized at the cell wall of said fungus, and 

(ii) at least a part of a gene encoding an anchoring protein capable of anchoring in the cell wall of said fungus, 
said part of a gene encoding at least the anchoring part of said anchoring protein, which anchoring part is 
derivable from the C-terminal part of said anchoring protein; 

said polynucleotide being present in a vector or in the chromosome of said fungus. 

2. A process according to claim 1 , wherein the fungus is selected from the group consisting of yeasts belonging to 
the genera Candida, Debaryomyces, Hansenula, Kluyveromyces, Pichia and Saccharomyces and moulds belong- 
ing to the genera Aspergillus, Penicillium and Rhizopus can be used. 

3. Process according to claim 1 or 2 wherein the protein capable of anchoring in the cell wall is selected from the 
group consisting of, AGA1 (=a-agglutinin) of S. cerevisiae, FL01 (= flocculation protein), Major Cell Wall Protein 
of lower eukaryotes, selected from the group consisting of yeasts and fungi. 

4. Process according to any of claims 1-3 wherein said fungus further comprises a sequence encoding a signal 
peptide ensuring secretion of the expression product of said first polynucleotide. 

5. Process according to any of claims 1-4 wherein the binding protein is an antibody, an antibody fragment, a com- 
bination of antibody fragments, a receptor protein, or an inactivated enzyme still capable of binding the correspond- 
ing substrate. 



Patentanspruche 

1 . Verfahren zum Durchfuhren eines Isolierungsverfahrens durch Verwenden eines immobilisierten Bindungsproteins 
oder eines funktionalen Teils davon, das (der) immer noch zur Bindung einer spezifischen Verbindung fShig ist, 
wobei das immobilisierte Protein an der AuGenseite derZellwand einer Wirtszelle lokalisiert ist, wobei ein Medium, 
das die spezifische Verbindung enthait, mit einer Wirtszelle unter Bedingungen in Kontakt gebracht wird, unter 
denen ein Komplex zwischen der spezifischen Verbindung und dem immobilisierten Bindungsprotein gebildet wird 
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und der Komplex von dem die spezifische Verbindung ursprQnglich enthaltenden Medium getrennt wird, wobei die 
Wirtszelle ein Pilz ist, ausgewShlt aus der Gruppe, die aus Hefen und Schimmelpilzen besteht, enthaltend ein 
exprimierbares Polynucleotid, umfassend 

5 (i) ein Strukturgen, das fur das Bindungsprotein oder einen funktionalen Teil davon, der immer noch die spe- 

zifische Bindungskapazitat aufweist, codiert, wobei das Bindungsprotein oder der funktionaie Teil davon auf 
der Zellwand des Pilzes lokalisiert ist, und 

(ii) zumindest einen Teil eines Genes, das fur ein Ankerprotein codiert, das zur Verankerung in der Zellwand 
10 des Pilzes fShig ist, wobei der Teil eines Genes zumindest fur den Verankerungsteil des Ankerproteins codiert, 

wobei der Verankerungsteil vom C-terminalen Teil des Ankerproteins ableitbar ist; 

wobei das Polynucleotid in einem Vektor oder Chromosom des Pilzes vorliegt. 

15 2. Verfahren gemSIi Anspruch 1 , wobei der Pilz aus der Gruppe ausgewShlt ist, die aus Hefen besteht, die zu den 
Genera Candida, Debaryomyces, Nansenula, Kluyveromyces, Pichia und Saccharomyces besteht und daruber 
hinaus Schimmelpilze verwendet werden kOnnen, die zu den Genera Aspergillus, Peniciilium und Rhizopus ge- 
horen. 

20 3. Verfahren gemaft Anspruch 1 oder 2, wobei das zum Verankern in der Zellwand fahige Proteine aus der Gruppe 
ausgewahlt ist, die besteht aus: AGA1 (=a-Agglutinin) von S. cerevisiae, FL01 (= Flocculationsprotein), Haupt- 
zellwandprotein von niederen Eukaryoten, ausgewahlt aus der Gruppe, die aus Hefen und Pilzen besteht. 

4. Verfahren gemSfi mindestens einem der Anspruche 1 bis 3, wobei der Pilz daruber hinaus eine Sequenz umfasst, 
25 die fur ein Sigalpeptid codiert, wodurch die Sekretion des Expressionsproduktes des ersten Polynucleotids sicher- 

gestellt wird. 

5. Verfahren gemSIS mindestens einem der AnsprQche 1 bis 4, wobei das Bindungsprotein ein Antikorper, ein Anti- 
koTperfragment, eine Kombination von Antikorperfragmenten, ein Rezeptorprotein oder ein inaktiviertes Enzym 

30 ist, das immer noch zur Bindung des entsprechenden Substrats fdhig ist. 



Revendications 

35 1. Processus adapte pour realiser un processus d'isolation en utilisant une proteine de liaison immobilisee ou une 
partie fonctionnelle de celle-ci toujours capable de se lier a un compose specifique, dans lequel la proteine immo- 
bilisee est situee a I'exterieur de la paroi cellulaire d'une cellule hdte, dans lequel un milieu contenant ledit compose 
specifique est mis en contact avec une cellule note dans des conditions ou un complexe entre ledit compose 
specifique et ladite proteine de liaison immobilisee est forme, separant ledit complexe du milieu contenant initia- 

40 lement ledit compose specifique, dans lequel la cellule hdte est un champignon choisi dans le groupe constitue 

par les levures et les moisissures, contenant un polynucleotide d'expression comprenant 

(i) un gene structure! codant la proteine de liaison ou une partie fonctionnelle de celle-ci ayant toujours la 
capacite de liaison specifique, ladite proteine de liaison ou ladite partie fonctionnelle de celle-ci etant situee 

45 au niveau de la paroi cellulaire dudit champignon, et 

(ii) au moins une partie d'un gene codant une proteine d'ancrage capable de se fixer a la paroi cellulaire dudit 
champignon, ladite partie d'un gene codant au moins la partie de fixation de ladite proteine d'ancrage, laquelle 
partie de fixation est susceptible de deriver de la partie C-terminal de ladite proteine d'ancrage ; 

50 ledit polynucleotide etant present dans un vecteur ou dans le chromosome dudit champignon. 

2. Processus selon la revendication 1 dans lequel il est possible d'utiliser le champignon choisi dans le groupe cons- 
titue par les levures appartenant aux genres Candida, Debaryomyces, Hansenula, Kluyveromyces, Pichia et Sac- 
charomyces et les moisissures appartenant aux genres Aspergillus, Peniciilium et Rhizopus. 

55 

3. Processus selon la revendication 1 ou 2 dans lequel la proteine capable de se fixer a la paroi cellulaire est choisie 
dans le groupe constitue par AGA1 (agglutinine A) de S. cerevisiae, FL01 (proteine de floculation), proteine ma- 
jeure de paroi cellulaire des eucaryotes inferieurs, selectionnee dans le groupe constitue par les levures et les 
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champignons. 

Processus seion Tune quelconque des revendications 1 a 3 dans lequel ledit champignon comprend en outre une 
sequence encodant un peptide signal garantissant la secretion du produit d'expression dudit premier polynucleo- 
tide. 

Processus selon Tune quelconque des revendications 1 a 4 dans lequel la proteine de liaison est un anticorps, un 
fragment d'anticorps, une combinaison de fragments d'anticorps, une proteine receptrice ou une enzyme inactivee 
toujours capable de se lier au substrat correspondant. 
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[Figure 19] 
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figure 20; 




55 



